Overview
Brought to you by YData
Dataset statistics
| Number of variables | 107 |
|---|---|
| Number of observations | 836209 |
| Missing cells | 36024469 |
| Missing cells (%) | 40.3% |
| Total size in memory | 682.6 MiB |
| Average record size in memory | 856.0 B |
Variable types
| Text | 107 |
|---|
Dataset
| Description | Naturalis Biodiversity Center (NL) - Botany 0061690-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.4ze7ns |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "Naturalis Biodiversity Center" | Constant |
rightsHolder has constant value "Naturalis Biodiversity Center" | Constant |
institutionID has constant value "https://ror.org/0566bfb96" | Constant |
collectionCode has constant value "Botany" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
sampleSizeValue has constant value "0.0 m" | Constant |
higherGeography has constant value "51.41942" | Constant |
latestEraOrHighestErathem has constant value "Bakker S" | Constant |
highestBiostratigraphicZone has constant value "2608920" | Constant |
identificationID has constant value "Physcia caesia (Hoffm.) Fürnr." | Constant |
identificationReferences has constant value "Fungi|Lichenes-Lecanoromycetes|Caliciales|Lichenes-Physciaceae" | Constant |
identificationVerificationStatus has constant value "Fungi" | Constant |
identificationRemarks has constant value "Ascomycota" | Constant |
taxonID has constant value "Lecanoromycetes" | Constant |
scientificNameID has constant value "Caliciales" | Constant |
parentNameUsageID has constant value "Physciaceae" | Constant |
taxonConceptID has constant value "Physcia" | Constant |
originalNameUsage has constant value "caesia" | Constant |
namePublishedInYear has constant value "SPECIES" | Constant |
subfamily has constant value "NL" | Constant |
tribe has constant value "2024-11-01T10:28:05.946Z" | Constant |
cultivarEpithet has constant value "true" | Constant |
verbatimTaxonRank has constant value "2608920" | Constant |
vernacularName has constant value "2608920" | Constant |
nomenclaturalStatus has constant value "180" | Constant |
taxonRemarks has constant value "10861608" | Constant |
elevation has constant value "2608920" | Constant |
elevationAccuracy has constant value "Physcia caesia" | Constant |
depth has constant value "Physcia caesia (Hoffm.) Fürnr." | Constant |
depthAccuracy has constant value "Physcia caesia (Hoffm.) Hampe ex Fürnr." | Constant |
typifiedName has constant value "NE" | Constant |
protocol has constant value "DWC_ARCHIVE" | Constant |
lastCrawled has constant value "2024-11-01T08:50:07.799Z" | Constant |
isSequenced has constant value "false" | Constant |
publishedByGbifRegion has constant value "EUROPE" | Constant |
otherCatalogNumbers has 626711 (74.9%) missing values | Missing |
eventDate has 143292 (17.1%) missing values | Missing |
startDayOfYear has 143292 (17.1%) missing values | Missing |
endDayOfYear has 143292 (17.1%) missing values | Missing |
year has 143292 (17.1%) missing values | Missing |
month has 176649 (21.1%) missing values | Missing |
day has 258263 (30.9%) missing values | Missing |
habitat has 686738 (82.1%) missing values | Missing |
sampleSizeValue has 836208 (> 99.9%) missing values | Missing |
higherGeography has 836208 (> 99.9%) missing values | Missing |
continent has 150752 (18.0%) missing values | Missing |
stateProvince has 512889 (61.3%) missing values | Missing |
locality has 123808 (14.8%) missing values | Missing |
verbatimElevation has 540040 (64.6%) missing values | Missing |
decimalLatitude has 483055 (57.8%) missing values | Missing |
decimalLongitude has 483055 (57.8%) missing values | Missing |
latestEraOrHighestErathem has 836208 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 836208 (> 99.9%) missing values | Missing |
identificationID has 836208 (> 99.9%) missing values | Missing |
typeStatus has 822537 (98.4%) missing values | Missing |
identifiedBy has 693965 (83.0%) missing values | Missing |
dateIdentified has 763698 (91.3%) missing values | Missing |
identificationReferences has 836208 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 836208 (> 99.9%) missing values | Missing |
identificationRemarks has 836208 (> 99.9%) missing values | Missing |
taxonID has 836208 (> 99.9%) missing values | Missing |
scientificNameID has 836208 (> 99.9%) missing values | Missing |
parentNameUsageID has 836208 (> 99.9%) missing values | Missing |
taxonConceptID has 836208 (> 99.9%) missing values | Missing |
originalNameUsage has 836208 (> 99.9%) missing values | Missing |
namePublishedInYear has 836208 (> 99.9%) missing values | Missing |
subfamily has 836208 (> 99.9%) missing values | Missing |
tribe has 836208 (> 99.9%) missing values | Missing |
genus has 13165 (1.6%) missing values | Missing |
genericName has 13241 (1.6%) missing values | Missing |
specificEpithet has 78237 (9.4%) missing values | Missing |
infraspecificEpithet has 778925 (93.1%) missing values | Missing |
cultivarEpithet has 836208 (> 99.9%) missing values | Missing |
verbatimTaxonRank has 836208 (> 99.9%) missing values | Missing |
vernacularName has 836208 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 836208 (> 99.9%) missing values | Missing |
taxonRemarks has 836208 (> 99.9%) missing values | Missing |
elevation has 836208 (> 99.9%) missing values | Missing |
elevationAccuracy has 836208 (> 99.9%) missing values | Missing |
depth has 836208 (> 99.9%) missing values | Missing |
depthAccuracy has 836208 (> 99.9%) missing values | Missing |
distanceFromCentroidInMeters has 833143 (99.6%) missing values | Missing |
issue has 776215 (92.8%) missing values | Missing |
mediaType has 57645 (6.9%) missing values | Missing |
genusKey has 13165 (1.6%) missing values | Missing |
speciesKey has 78171 (9.3%) missing values | Missing |
species has 78171 (9.3%) missing values | Missing |
typifiedName has 836208 (> 99.9%) missing values | Missing |
gbifRegion has 151640 (18.1%) missing values | Missing |
level0Gid has 497950 (59.5%) missing values | Missing |
level0Name has 497950 (59.5%) missing values | Missing |
level1Gid has 499035 (59.7%) missing values | Missing |
level1Name has 499035 (59.7%) missing values | Missing |
level2Gid has 501994 (60.0%) missing values | Missing |
level2Name has 501999 (60.0%) missing values | Missing |
level3Gid has 695853 (83.2%) missing values | Missing |
level3Name has 697659 (83.4%) missing values | Missing |
iucnRedListCategory has 73123 (8.7%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 23:38:45.321100 |
|---|---|
| Analysis finished | 2025-01-08 23:39:25.846555 |
| Duration | 40.53 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 836209 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 836209 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2514633172 |
|---|---|
| 2nd row | 2980371442 |
| 3rd row | 2514602651 |
| 4th row | 2980366433 |
| 5th row | 2514610075 |
| Value | Count | Frequency (%) |
| 2514633172 | 1 | < 0.1% |
| 2980380439 | 1 | < 0.1% |
| 2980369451 | 1 | < 0.1% |
| 2514646162 | 1 | < 0.1% |
| 2980370447 | 1 | < 0.1% |
| 2514602651 | 1 | < 0.1% |
| 2980366433 | 1 | < 0.1% |
| 2514610075 | 1 | < 0.1% |
| 2980364432 | 1 | < 0.1% |
| 2516414075 | 1 | < 0.1% |
| Other values (836199) | 836199 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 1484799 | |
| 2 | 1326518 | |
| 1 | 1318720 | |
| 4 | 696797 | |
| 6 | 681549 | |
| 3 | 673136 | |
| 7 | 644384 | |
| 0 | 519930 | 6.2% |
| 8 | 508910 | 6.1% |
| 9 | 507347 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8362090 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1484799 | |
| 2 | 1326518 | |
| 1 | 1318720 | |
| 4 | 696797 | |
| 6 | 681549 | |
| 3 | 673136 | |
| 7 | 644384 | |
| 0 | 519930 | 6.2% |
| 8 | 508910 | 6.1% |
| 9 | 507347 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8362090 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 1484799 | |
| 2 | 1326518 | |
| 1 | 1318720 | |
| 4 | 696797 | |
| 6 | 681549 | |
| 3 | 673136 | |
| 7 | 644384 | |
| 0 | 519930 | 6.2% |
| 8 | 508910 | 6.1% |
| 9 | 507347 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8362090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 1484799 | |
| 2 | 1326518 | |
| 1 | 1318720 | |
| 4 | 696797 | |
| 6 | 681549 | |
| 3 | 673136 | |
| 7 | 644384 | |
| 0 | 519930 | 6.2% |
| 8 | 508910 | 6.1% |
| 9 | 507347 | 6.1% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 836209 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1672418 | |
| 0 | 1672418 | |
| _ | 1672418 | |
| 1 | 836209 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2508627 | |
| Uppercase Letter | 1672418 | |
| Connector Punctuation | 1672418 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1672418 | |
| 1 | 836209 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1672418 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1672418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4181045 | |
| Latin | 1672418 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1672418 | |
| _ | 1672418 | |
| 1 | 836209 |
Latin
| Value | Count | Frequency (%) |
| C | 1672418 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5853463 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1672418 | |
| 0 | 1672418 | |
| _ | 1672418 | |
| 1 | 836209 |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Naturalis Biodiversity Center |
|---|---|
| 2nd row | Naturalis Biodiversity Center |
| 3rd row | Naturalis Biodiversity Center |
| 4th row | Naturalis Biodiversity Center |
| 5th row | Naturalis Biodiversity Center |
| Value | Count | Frequency (%) |
| naturalis | 836209 | |
| biodiversity | 836209 | |
| center | 836209 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| 1672418 | 6.9% | |
| s | 1672418 | 6.9% |
| a | 1672418 | 6.9% |
| d | 836209 | 3.4% |
| C | 836209 | 3.4% |
| y | 836209 | 3.4% |
| Other values (7) | 5853463 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20069016 | |
| Uppercase Letter | 2508627 | 10.3% |
| Space Separator | 1672418 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| s | 1672418 | |
| a | 1672418 | |
| d | 836209 | 4.2% |
| y | 836209 | 4.2% |
| v | 836209 | 4.2% |
| o | 836209 | 4.2% |
| Other values (3) | 2508627 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 836209 | |
| N | 836209 | |
| B | 836209 |
Space Separator
| Value | Count | Frequency (%) |
| 1672418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22577643 | |
| Common | 1672418 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| s | 1672418 | 7.4% |
| a | 1672418 | 7.4% |
| d | 836209 | 3.7% |
| C | 836209 | 3.7% |
| y | 836209 | 3.7% |
| v | 836209 | 3.7% |
| Other values (6) | 5017254 |
Common
| Value | Count | Frequency (%) |
| 1672418 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24250061 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| 1672418 | 6.9% | |
| s | 1672418 | 6.9% |
| a | 1672418 | 6.9% |
| d | 836209 | 3.4% |
| C | 836209 | 3.4% |
| y | 836209 | 3.4% |
| Other values (7) | 5853463 |
rightsHolder
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Naturalis Biodiversity Center |
|---|---|
| 2nd row | Naturalis Biodiversity Center |
| 3rd row | Naturalis Biodiversity Center |
| 4th row | Naturalis Biodiversity Center |
| 5th row | Naturalis Biodiversity Center |
| Value | Count | Frequency (%) |
| naturalis | 836209 | |
| biodiversity | 836209 | |
| center | 836209 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| 1672418 | 6.9% | |
| s | 1672418 | 6.9% |
| a | 1672418 | 6.9% |
| d | 836209 | 3.4% |
| C | 836209 | 3.4% |
| y | 836209 | 3.4% |
| Other values (7) | 5853463 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20069016 | |
| Uppercase Letter | 2508627 | 10.3% |
| Space Separator | 1672418 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| s | 1672418 | |
| a | 1672418 | |
| d | 836209 | 4.2% |
| y | 836209 | 4.2% |
| v | 836209 | 4.2% |
| o | 836209 | 4.2% |
| Other values (3) | 2508627 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 836209 | |
| N | 836209 | |
| B | 836209 |
Space Separator
| Value | Count | Frequency (%) |
| 1672418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22577643 | |
| Common | 1672418 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| s | 1672418 | 7.4% |
| a | 1672418 | 7.4% |
| d | 836209 | 3.7% |
| C | 836209 | 3.7% |
| y | 836209 | 3.7% |
| v | 836209 | 3.7% |
| Other values (6) | 5017254 |
Common
| Value | Count | Frequency (%) |
| 1672418 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24250061 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3344836 | |
| t | 2508627 | |
| r | 2508627 | |
| e | 2508627 | |
| 1672418 | 6.9% | |
| s | 1672418 | 6.9% |
| a | 1672418 | 6.9% |
| d | 836209 | 3.4% |
| C | 836209 | 3.4% |
| y | 836209 | 3.4% |
| Other values (7) | 5853463 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 25 |
| Mean length | 25 |
| Min length | 25 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | https://ror.org/0566bfb96 |
|---|---|
| 2nd row | https://ror.org/0566bfb96 |
| 3rd row | https://ror.org/0566bfb96 |
| 4th row | https://ror.org/0566bfb96 |
| 5th row | https://ror.org/0566bfb96 |
| Value | Count | Frequency (%) |
| https://ror.org/0566bfb96 | 836209 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 2508627 | |
| r | 2508627 | |
| 6 | 2508627 | |
| t | 1672418 | 8.0% |
| o | 1672418 | 8.0% |
| b | 1672418 | 8.0% |
| h | 836209 | 4.0% |
| p | 836209 | 4.0% |
| s | 836209 | 4.0% |
| : | 836209 | 4.0% |
| Other values (6) | 5017254 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11706926 | |
| Decimal Number | 5017254 | |
| Other Punctuation | 4181045 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2508627 | |
| t | 1672418 | |
| o | 1672418 | |
| b | 1672418 | |
| h | 836209 | 7.1% |
| p | 836209 | 7.1% |
| s | 836209 | 7.1% |
| g | 836209 | 7.1% |
| f | 836209 | 7.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2508627 | |
| 0 | 836209 | 16.7% |
| 5 | 836209 | 16.7% |
| 9 | 836209 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2508627 | |
| : | 836209 | 20.0% |
| . | 836209 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11706926 | |
| Common | 9198299 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2508627 | |
| t | 1672418 | |
| o | 1672418 | |
| b | 1672418 | |
| h | 836209 | 7.1% |
| p | 836209 | 7.1% |
| s | 836209 | 7.1% |
| g | 836209 | 7.1% |
| f | 836209 | 7.1% |
Common
| Value | Count | Frequency (%) |
| / | 2508627 | |
| 6 | 2508627 | |
| : | 836209 | 9.1% |
| . | 836209 | 9.1% |
| 0 | 836209 | 9.1% |
| 5 | 836209 | 9.1% |
| 9 | 836209 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20905225 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 2508627 | |
| r | 2508627 | |
| 6 | 2508627 | |
| t | 1672418 | 8.0% |
| o | 1672418 | 8.0% |
| b | 1672418 | 8.0% |
| h | 836209 | 4.0% |
| p | 836209 | 4.0% |
| s | 836209 | 4.0% |
| : | 836209 | 4.0% |
| Other values (6) | 5017254 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Botany |
|---|---|
| 2nd row | Botany |
| 3rd row | Botany |
| 4th row | Botany |
| 5th row | Botany |
| Value | Count | Frequency (%) |
| botany | 836209 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 836209 | |
| o | 836209 | |
| t | 836209 | |
| a | 836209 | |
| n | 836209 | |
| y | 836209 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4181045 | |
| Uppercase Letter | 836209 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 836209 | |
| t | 836209 | |
| a | 836209 | |
| n | 836209 | |
| y | 836209 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 836209 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5017254 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 836209 | |
| o | 836209 | |
| t | 836209 | |
| a | 836209 | |
| n | 836209 | |
| y | 836209 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5017254 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 836209 | |
| o | 836209 | |
| t | 836209 | |
| a | 836209 | |
| n | 836209 | |
| y | 836209 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.99888066 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 836092 | |
| occurrence | 117 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4180694 | |
| R | 1672418 | |
| P | 1672184 | 11.1% |
| S | 1672184 | 11.1% |
| C | 836443 | 5.6% |
| N | 836209 | 5.6% |
| V | 836092 | 5.6% |
| D | 836092 | 5.6% |
| _ | 836092 | 5.6% |
| I | 836092 | 5.6% |
| Other values (3) | 836326 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 14214734 | |
| Connector Punctuation | 836092 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4180694 | |
| R | 1672418 | |
| P | 1672184 | 11.8% |
| S | 1672184 | 11.8% |
| C | 836443 | 5.9% |
| N | 836209 | 5.9% |
| V | 836092 | 5.9% |
| D | 836092 | 5.9% |
| I | 836092 | 5.9% |
| M | 836092 | 5.9% |
| Other values (2) | 234 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 836092 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14214734 | |
| Common | 836092 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 4180694 | |
| R | 1672418 | |
| P | 1672184 | 11.8% |
| S | 1672184 | 11.8% |
| C | 836443 | 5.9% |
| N | 836209 | 5.9% |
| V | 836092 | 5.9% |
| D | 836092 | 5.9% |
| I | 836092 | 5.9% |
| M | 836092 | 5.9% |
| Other values (2) | 234 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 836092 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15050826 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 4180694 | |
| R | 1672418 | |
| P | 1672184 | 11.1% |
| S | 1672184 | 11.1% |
| C | 836443 | 5.6% |
| N | 836209 | 5.6% |
| V | 836092 | 5.6% |
| D | 836092 | 5.6% |
| _ | 836092 | 5.6% |
| I | 836092 | 5.6% |
| Other values (3) | 836326 | 5.6% |
occurrenceID
Text
Unique 
| Distinct | 836209 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 81 |
|---|---|
| Median length | 61 |
| Mean length | 61.65443926 |
| Min length | 48 |
Unique
| Unique | 836209 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | https://data.biodiversitydata.nl/naturalis/specimen/L.2851604 |
|---|---|
| 2nd row | https://data.biodiversitydata.nl/naturalis/specimen/L%20%200971472 |
| 3rd row | https://data.biodiversitydata.nl/naturalis/specimen/L.2851644 |
| 4th row | https://data.biodiversitydata.nl/naturalis/specimen/L%20%200971531 |
| 5th row | https://data.biodiversitydata.nl/naturalis/specimen/L.2851686 |
| Value | Count | Frequency (%) |
| https://data.biodiversitydata.nl/naturalis/specimen/wag0100360 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.2851604 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.2852416 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l%20%200972015 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.2852067 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l%20%200971964 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.2851644 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l%20%200971531 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.2851686 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l%20%200971533 | 1 | < 0.1% |
| Other values (836198) | 836198 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5017261 | 9.7% |
| t | 5017255 | 9.7% |
| i | 4181045 | 8.1% |
| / | 4181044 | 8.1% |
| s | 3344836 | 6.5% |
| n | 2508627 | 4.9% |
| e | 2508627 | 4.9% |
| d | 2508627 | 4.9% |
| . | 2447805 | 4.7% |
| l | 1672421 | 3.2% |
| Other values (45) | 18168449 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36793237 | |
| Other Punctuation | 7572035 | 14.7% |
| Decimal Number | 6038903 | 11.7% |
| Uppercase Letter | 1151820 | 2.2% |
| Connector Punctuation | 1 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5017261 | |
| t | 5017255 | |
| i | 4181045 | |
| s | 3344836 | |
| n | 2508627 | 6.8% |
| e | 2508627 | 6.8% |
| d | 2508627 | 6.8% |
| l | 1672421 | 4.5% |
| p | 1672418 | 4.5% |
| r | 1672418 | 4.5% |
| Other values (10) | 6689702 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 582349 | |
| A | 157806 | 13.7% |
| G | 140829 | 12.2% |
| W | 140821 | 12.2% |
| U | 96045 | 8.3% |
| M | 16975 | 1.5% |
| D | 16972 | 1.5% |
| P | 4 | < 0.1% |
| N | 3 | < 0.1% |
| F | 3 | < 0.1% |
| Other values (8) | 13 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 902344 | |
| 2 | 795873 | |
| 0 | 680107 | |
| 3 | 666803 | |
| 4 | 571191 | |
| 7 | 498967 | |
| 5 | 497652 | |
| 6 | 480049 | |
| 9 | 474458 | |
| 8 | 471459 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 4181044 | |
| . | 2447805 | |
| : | 836209 | 11.0% |
| % | 106968 | 1.4% |
| ! | 9 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37945057 | |
| Common | 13610940 | 26.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5017261 | |
| t | 5017255 | |
| i | 4181045 | |
| s | 3344836 | |
| n | 2508627 | 6.6% |
| e | 2508627 | 6.6% |
| d | 2508627 | 6.6% |
| l | 1672421 | 4.4% |
| p | 1672418 | 4.4% |
| r | 1672418 | 4.4% |
| Other values (28) | 7841522 |
Common
| Value | Count | Frequency (%) |
| / | 4181044 | |
| . | 2447805 | |
| 1 | 902344 | 6.6% |
| : | 836209 | 6.1% |
| 2 | 795873 | 5.8% |
| 0 | 680107 | 5.0% |
| 3 | 666803 | 4.9% |
| 4 | 571191 | 4.2% |
| 7 | 498967 | 3.7% |
| 5 | 497652 | 3.7% |
| Other values (7) | 1532945 | 11.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51555997 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5017261 | 9.7% |
| t | 5017255 | 9.7% |
| i | 4181045 | 8.1% |
| / | 4181044 | 8.1% |
| s | 3344836 | 6.5% |
| n | 2508627 | 4.9% |
| e | 2508627 | 4.9% |
| d | 2508627 | 4.9% |
| . | 2447805 | 4.7% |
| l | 1672421 | 3.2% |
| Other values (45) | 18168449 |
catalogNumber
Text
| Distinct | 836208 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 9.398614938 |
| Min length | 8 |
Unique
| Unique | 836208 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | L.2851604 |
|---|---|
| 2nd row | L 0971472 |
| 3rd row | L.2851644 |
| 4th row | L 0971531 |
| 5th row | L.2851686 |
| Value | Count | Frequency (%) |
| l | 42432 | 4.8% |
| u | 11055 | 1.2% |
| 0001135 | 2 | < 0.1% |
| 0001034 | 2 | < 0.1% |
| 0000756 | 2 | < 0.1% |
| 0000796 | 2 | < 0.1% |
| 0000857 | 2 | < 0.1% |
| 0000899 | 2 | < 0.1% |
| 0000981 | 2 | < 0.1% |
| 0001074 | 2 | < 0.1% |
| Other values (835366) | 836198 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 902344 | |
| . | 775387 | |
| 2 | 688910 | |
| 3 | 666802 | |
| L | 582349 | 7.4% |
| 0 | 573140 | 7.3% |
| 4 | 571191 | 7.3% |
| 7 | 498967 | 6.3% |
| 5 | 497652 | 6.3% |
| 6 | 480045 | 6.1% |
| Other values (35) | 1622410 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5824968 | |
| Uppercase Letter | 1151819 | 14.7% |
| Other Punctuation | 775397 | 9.9% |
| Space Separator | 106963 | 1.4% |
| Lowercase Letter | 44 | < 0.1% |
| Modifier Symbol | 4 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 582349 | |
| A | 157806 | 13.7% |
| G | 140829 | 12.2% |
| W | 140821 | 12.2% |
| U | 96045 | 8.3% |
| M | 16975 | 1.5% |
| D | 16972 | 1.5% |
| P | 4 | < 0.1% |
| I | 3 | < 0.1% |
| N | 3 | < 0.1% |
| Other values (8) | 12 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 902344 | |
| 2 | 688910 | |
| 3 | 666802 | |
| 0 | 573140 | |
| 4 | 571191 | |
| 7 | 498967 | |
| 5 | 497652 | |
| 6 | 480045 | |
| 9 | 474458 | |
| 8 | 471459 |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 17 | |
| g | 10 | |
| a | 7 | |
| l | 3 | 6.8% |
| o | 2 | 4.5% |
| t | 1 | 2.3% |
| u | 1 | 2.3% |
| v | 1 | 2.3% |
| n | 1 | 2.3% |
| e | 1 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 775387 | |
| ! | 9 | < 0.1% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 106963 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6707334 | |
| Latin | 1151863 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 582349 | |
| A | 157806 | 13.7% |
| G | 140829 | 12.2% |
| W | 140821 | 12.2% |
| U | 96045 | 8.3% |
| M | 16975 | 1.5% |
| D | 16972 | 1.5% |
| w | 17 | < 0.1% |
| g | 10 | < 0.1% |
| a | 7 | < 0.1% |
| Other values (18) | 32 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 | 902344 | |
| . | 775387 | |
| 2 | 688910 | |
| 3 | 666802 | |
| 0 | 573140 | |
| 4 | 571191 | |
| 7 | 498967 | |
| 5 | 497652 | |
| 6 | 480045 | |
| 9 | 474458 | |
| Other values (7) | 578438 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7859197 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 902344 | |
| . | 775387 | |
| 2 | 688910 | |
| 3 | 666802 | |
| L | 582349 | 7.4% |
| 0 | 573140 | 7.3% |
| 4 | 571191 | 7.3% |
| 7 | 498967 | 6.3% |
| 5 | 497652 | 6.3% |
| 6 | 480045 | 6.1% |
| Other values (35) | 1622410 |
recordNumber
Text
| Distinct | 572294 |
|---|---|
| Distinct (%) | 68.4% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 110 |
|---|---|
| Median length | 103 |
| Mean length | 21.18251701 |
| Min length | 2 |
Unique
| Unique | 542010 ? |
|---|---|
| Unique (%) | 64.8% |
Sample
| 1st row | Unknown s.n. |
|---|---|
| 2nd row | Zainoeddin bb 17357 |
| 3rd row | Wijk, JH van s.n. |
| 4th row | Unknown bb 17412 |
| 5th row | Koster, JT 6255 |
| Value | Count | Frequency (%) |
| s.n | 256450 | 7.7% |
| van | 68460 | 2.1% |
| unknown | 67287 | 2.0% |
| de | 50387 | 1.5% |
| a | 44401 | 1.3% |
| j | 43306 | 1.3% |
| m | 26511 | 0.8% |
| h | 23612 | 0.7% |
| p | 23387 | 0.7% |
| r | 23030 | 0.7% |
| Other values (92734) | 2694093 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2484713 | 14.0% | |
| n | 1099909 | 6.2% |
| e | 1017309 | 5.7% |
| , | 927788 | 5.2% |
| a | 730231 | 4.1% |
| s | 643605 | 3.6% |
| r | 584399 | 3.3% |
| o | 579526 | 3.3% |
| . | 547214 | 3.1% |
| i | 479904 | 2.7% |
| Other values (119) | 8618371 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7937101 | |
| Uppercase Letter | 3138266 | 17.7% |
| Space Separator | 2484717 | 14.0% |
| Decimal Number | 2321671 | 13.1% |
| Other Punctuation | 1748190 | 9.9% |
| Dash Punctuation | 63539 | 0.4% |
| Open Punctuation | 9522 | 0.1% |
| Close Punctuation | 9517 | 0.1% |
| Connector Punctuation | 352 | < 0.1% |
| Math Symbol | 52 | < 0.1% |
| Other values (3) | 42 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1099909 | |
| e | 1017309 | |
| a | 730231 | |
| s | 643605 | 8.1% |
| r | 584399 | 7.4% |
| o | 579526 | 7.3% |
| i | 479904 | 6.0% |
| l | 369214 | 4.7% |
| t | 342007 | 4.3% |
| d | 276655 | 3.5% |
| Other values (44) | 1814342 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 322743 | 10.3% |
| H | 219232 | 7.0% |
| A | 214963 | 6.8% |
| S | 214864 | 6.8% |
| B | 205943 | 6.6% |
| M | 195533 | 6.2% |
| C | 166837 | 5.3% |
| W | 157895 | 5.0% |
| P | 154576 | 4.9% |
| R | 139022 | 4.4% |
| Other values (27) | 1146658 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 927788 | |
| . | 547214 | |
| ; | 258390 | 14.8% |
| / | 8084 | 0.5% |
| ' | 5032 | 0.3% |
| ! | 1088 | 0.1% |
| : | 366 | < 0.1% |
| ? | 114 | < 0.1% |
| \ | 42 | < 0.1% |
| * | 34 | < 0.1% |
| Other values (3) | 38 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 364437 | |
| 2 | 275734 | |
| 3 | 245370 | |
| 4 | 225943 | |
| 5 | 216525 | |
| 6 | 208526 | |
| 7 | 201503 | |
| 0 | 195678 | |
| 8 | 194815 | |
| 9 | 193140 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 9242 | |
| [ | 279 | 2.9% |
| { | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2484713 | ||
| 4 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9239 | |
| ] | 278 | 2.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 46 | |
| = | 6 | 11.5% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 15 | |
| ¼ | 2 | 11.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 63539 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 352 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 22 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11075370 | |
| Common | 6637599 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1099909 | 9.9% |
| e | 1017309 | 9.2% |
| a | 730231 | 6.6% |
| s | 643605 | 5.8% |
| r | 584399 | 5.3% |
| o | 579526 | 5.2% |
| i | 479904 | 4.3% |
| l | 369214 | 3.3% |
| t | 342007 | 3.1% |
| J | 322743 | 2.9% |
| Other values (82) | 4906523 |
Common
| Value | Count | Frequency (%) |
| 2484713 | ||
| , | 927788 | 14.0% |
| . | 547214 | 8.2% |
| 1 | 364437 | 5.5% |
| 2 | 275734 | 4.2% |
| ; | 258390 | 3.9% |
| 3 | 245370 | 3.7% |
| 4 | 225943 | 3.4% |
| 5 | 216525 | 3.3% |
| 6 | 208526 | 3.1% |
| Other values (27) | 882959 | 13.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17673878 | |
| None | 39091 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2484713 | 14.1% | |
| n | 1099909 | 6.2% |
| e | 1017309 | 5.8% |
| , | 927788 | 5.2% |
| a | 730231 | 4.1% |
| s | 643605 | 3.6% |
| r | 584399 | 3.3% |
| o | 579526 | 3.3% |
| . | 547214 | 3.1% |
| i | 479904 | 2.7% |
| Other values (75) | 8579280 |
None
| Value | Count | Frequency (%) |
| é | 10707 | |
| ü | 7216 | |
| ö | 3633 | 9.3% |
| á | 3193 | 8.2% |
| è | 2754 | 7.0% |
| í | 1869 | 4.8% |
| ñ | 1745 | 4.5% |
| ß | 1460 | 3.7% |
| ó | 1358 | 3.5% |
| ë | 879 | 2.2% |
| Other values (34) | 4277 | 10.9% |
recordedBy
Text
| Distinct | 48505 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 1735 |
| Missing (%) | 0.2% |
| Memory size | 6.4 MiB |
Length
| Max length | 98 |
|---|---|
| Median length | 94 |
| Mean length | 14.5336799 |
| Min length | 1 |
Unique
| Unique | 21906 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Zainoeddin |
| 3rd row | Wijk JH van |
| 4th row | Unknown |
| 5th row | Koster JT |
| Value | Count | Frequency (%) |
| van | 68460 | 2.9% |
| unknown | 67287 | 2.9% |
| de | 50384 | 2.2% |
| j | 43131 | 1.8% |
| a | 35050 | 1.5% |
| m | 25600 | 1.1% |
| h | 23002 | 1.0% |
| r | 22773 | 1.0% |
| al | 22732 | 1.0% |
| p | 22508 | 1.0% |
| Other values (25712) | 1960338 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1506829 | 12.4% | |
| e | 1015405 | 8.4% |
| n | 838456 | 6.9% |
| a | 723122 | 6.0% |
| r | 581380 | 4.8% |
| o | 577768 | 4.8% |
| i | 477363 | 3.9% |
| s | 386244 | 3.2% |
| l | 367602 | 3.0% |
| t | 341134 | 2.8% |
| Other values (106) | 5312675 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7372185 | |
| Uppercase Letter | 2900716 | 23.9% |
| Space Separator | 1506833 | 12.4% |
| Other Punctuation | 289450 | 2.4% |
| Dash Punctuation | 42507 | 0.4% |
| Decimal Number | 7800 | 0.1% |
| Open Punctuation | 4069 | < 0.1% |
| Close Punctuation | 4067 | < 0.1% |
| Connector Punctuation | 331 | < 0.1% |
| Math Symbol | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1015405 | |
| n | 838456 | |
| a | 723122 | |
| r | 581380 | 7.9% |
| o | 577768 | 7.8% |
| i | 477363 | 6.5% |
| s | 386244 | 5.2% |
| l | 367602 | 5.0% |
| t | 341134 | 4.6% |
| d | 272450 | 3.7% |
| Other values (43) | 1791261 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 322054 | 11.1% |
| H | 212297 | 7.3% |
| A | 192463 | 6.6% |
| M | 192264 | 6.6% |
| S | 185383 | 6.4% |
| B | 184355 | 6.4% |
| C | 162164 | 5.6% |
| W | 152179 | 5.2% |
| R | 130864 | 4.5% |
| P | 129745 | 4.5% |
| Other values (27) | 1036948 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1670 | |
| 9 | 1504 | |
| 6 | 974 | |
| 7 | 883 | |
| 4 | 725 | |
| 8 | 645 | 8.3% |
| 0 | 378 | 4.8% |
| 5 | 348 | 4.5% |
| 2 | 339 | 4.3% |
| 3 | 334 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 258389 | |
| . | 25959 | 9.0% |
| ' | 4957 | 1.7% |
| ? | 70 | < 0.1% |
| / | 45 | < 0.1% |
| & | 20 | < 0.1% |
| ! | 6 | < 0.1% |
| ¡ | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1506829 | ||
| 4 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 16 | |
| = | 4 | 20.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42507 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4069 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4067 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 331 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10272901 | |
| Common | 1855077 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1015405 | 9.9% |
| n | 838456 | 8.2% |
| a | 723122 | 7.0% |
| r | 581380 | 5.7% |
| o | 577768 | 5.6% |
| i | 477363 | 4.6% |
| s | 386244 | 3.8% |
| l | 367602 | 3.6% |
| t | 341134 | 3.3% |
| J | 322054 | 3.1% |
| Other values (80) | 4642373 |
Common
| Value | Count | Frequency (%) |
| 1506829 | ||
| ; | 258389 | 13.9% |
| - | 42507 | 2.3% |
| . | 25959 | 1.4% |
| ' | 4957 | 0.3% |
| ( | 4069 | 0.2% |
| ) | 4067 | 0.2% |
| 1 | 1670 | 0.1% |
| 9 | 1504 | 0.1% |
| 6 | 974 | 0.1% |
| Other values (16) | 4152 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12090377 | |
| None | 37601 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1506829 | 12.5% | |
| e | 1015405 | 8.4% |
| n | 838456 | 6.9% |
| a | 723122 | 6.0% |
| r | 581380 | 4.8% |
| o | 577768 | 4.8% |
| i | 477363 | 3.9% |
| s | 386244 | 3.2% |
| l | 367602 | 3.0% |
| t | 341134 | 2.8% |
| Other values (66) | 5275074 |
None
| Value | Count | Frequency (%) |
| é | 10707 | |
| ü | 7216 | |
| ö | 3633 | 9.7% |
| á | 3193 | 8.5% |
| è | 2754 | 7.3% |
| í | 1869 | 5.0% |
| ñ | 1745 | 4.6% |
| ó | 1358 | 3.6% |
| ë | 879 | 2.3% |
| ä | 815 | 2.2% |
| Other values (30) | 3432 | 9.1% |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 836208 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1672416 | |
| P | 836208 | |
| R | 836208 | |
| S | 836208 | |
| N | 836208 | |
| T | 836208 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5853456 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1672416 | |
| P | 836208 | |
| R | 836208 | |
| S | 836208 | |
| N | 836208 | |
| T | 836208 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5853456 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1672416 | |
| P | 836208 | |
| R | 836208 | |
| S | 836208 | |
| N | 836208 | |
| T | 836208 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5853456 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1672416 | |
| P | 836208 | |
| R | 836208 | |
| S | 836208 | |
| N | 836208 | |
| T | 836208 |
Missing 
| Distinct | 208546 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 626711 |
| Missing (%) | 74.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 10 |
| Mean length | 10.80295755 |
| Min length | 1 |
Unique
| Unique | 207683 ? |
|---|---|
| Unique (%) | 99.1% |
Sample
| 1st row | L 0215467 |
|---|---|
| 2nd row | L 0215532 |
| 3rd row | L 0204325 |
| 4th row | L 0542724 |
| 5th row | L 0973113 |
| Value | Count | Frequency (%) |
| l | 106173 | |
| u | 21496 | 6.3% |
| b | 684 | 0.2% |
| uw | 595 | 0.2% |
| a | 425 | 0.1% |
| fhow | 27 | < 0.1% |
| unw | 22 | < 0.1% |
| madw | 20 | < 0.1% |
| bw | 19 | < 0.1% |
| 0 | 18 | < 0.1% |
| Other values (208545) | 210806 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 343786 | |
| 258456 | ||
| 1 | 177209 | 7.8% |
| 2 | 165005 | 7.3% |
| 3 | 153082 | 6.8% |
| 9 | 149145 | 6.6% |
| 4 | 140099 | 6.2% |
| 5 | 138989 | 6.1% |
| 8 | 135727 | 6.0% |
| 6 | 131580 | 5.8% |
| Other values (61) | 470120 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1656769 | |
| Uppercase Letter | 300499 | 13.3% |
| Space Separator | 258456 | 11.4% |
| Other Punctuation | 31261 | 1.4% |
| Lowercase Letter | 16106 | 0.7% |
| Dash Punctuation | 100 | < 0.1% |
| Modifier Symbol | 6 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 106259 | |
| A | 52483 | |
| W | 50540 | |
| G | 50437 | |
| U | 29934 | 10.0% |
| D | 2506 | 0.8% |
| M | 1441 | 0.5% |
| F | 1003 | 0.3% |
| O | 898 | 0.3% |
| B | 884 | 0.3% |
| Other values (16) | 4114 | 1.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 11100 | |
| u | 530 | 3.3% |
| e | 496 | 3.1% |
| i | 445 | 2.8% |
| a | 402 | 2.5% |
| n | 376 | 2.3% |
| j | 348 | 2.2% |
| p | 291 | 1.8% |
| t | 281 | 1.7% |
| l | 263 | 1.6% |
| Other values (14) | 1574 | 9.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 343786 | |
| 1 | 177209 | |
| 2 | 165005 | |
| 3 | 153082 | |
| 9 | 149145 | |
| 4 | 140099 | |
| 5 | 138989 | |
| 8 | 135727 | 8.2% |
| 6 | 131580 | 7.9% |
| 7 | 122147 | 7.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 30046 | |
| . | 1078 | 3.4% |
| : | 105 | 0.3% |
| / | 29 | 0.1% |
| ? | 1 | < 0.1% |
| ' | 1 | < 0.1% |
| ! | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 258456 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 100 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 6 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1946593 | |
| Latin | 316605 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 106259 | |
| A | 52483 | |
| W | 50540 | |
| G | 50437 | |
| U | 29934 | 9.5% |
| w | 11100 | 3.5% |
| D | 2506 | 0.8% |
| M | 1441 | 0.5% |
| F | 1003 | 0.3% |
| O | 898 | 0.3% |
| Other values (40) | 10004 | 3.2% |
Common
| Value | Count | Frequency (%) |
| 0 | 343786 | |
| 258456 | ||
| 1 | 177209 | |
| 2 | 165005 | |
| 3 | 153082 | |
| 9 | 149145 | |
| 4 | 140099 | |
| 5 | 138989 | |
| 8 | 135727 | 7.0% |
| 6 | 131580 | 6.8% |
| Other values (11) | 153515 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2263198 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 343786 | |
| 258456 | ||
| 1 | 177209 | 7.8% |
| 2 | 165005 | 7.3% |
| 3 | 153082 | 6.8% |
| 9 | 149145 | 6.6% |
| 4 | 140099 | 6.2% |
| 5 | 138989 | 6.1% |
| 8 | 135727 | 6.0% |
| 6 | 131580 | 5.8% |
| Other values (61) | 470120 |
eventDate
Text
Missing 
| Distinct | 55581 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 143292 |
| Missing (%) | 17.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 11.82515511 |
| Min length | 10 |
Unique
| Unique | 8468 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | 1933-04-24 |
|---|---|
| 2nd row | 1956-05-14 |
| 3rd row | 1939-05-21 |
| 4th row | 1955-04-26 |
| 5th row | 1838-05-01/1838-05-31 |
| Value | Count | Frequency (%) |
| 1859-01-01/1859-12-31 | 870 | 0.1% |
| 1857-01-01/1857-12-31 | 606 | 0.1% |
| 1898-01-01/1898-12-31 | 535 | 0.1% |
| 1922-10-01/1922-10-31 | 490 | 0.1% |
| 1912-01-01/1912-12-31 | 463 | 0.1% |
| 1900-01-01/1900-12-31 | 443 | 0.1% |
| 1840-01-01/1840-12-31 | 438 | 0.1% |
| 1909-01-01/1909-12-31 | 438 | 0.1% |
| 1893-01-01/1893-12-31 | 434 | 0.1% |
| 1880-01-01/1880-12-31 | 425 | 0.1% |
| Other values (55571) | 687775 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1643592 | |
| - | 1615776 | |
| 0 | 1247462 | |
| 9 | 943290 | |
| 2 | 517481 | 6.3% |
| 8 | 437504 | 5.3% |
| 3 | 388459 | 4.7% |
| 6 | 361794 | 4.4% |
| 7 | 360295 | 4.4% |
| 5 | 319026 | 3.9% |
| Other values (2) | 359172 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6463104 | |
| Dash Punctuation | 1615776 | 19.7% |
| Other Punctuation | 114971 | 1.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1643592 | |
| 0 | 1247462 | |
| 9 | 943290 | |
| 2 | 517481 | 8.0% |
| 8 | 437504 | 6.8% |
| 3 | 388459 | 6.0% |
| 6 | 361794 | 5.6% |
| 7 | 360295 | 5.6% |
| 5 | 319026 | 4.9% |
| 4 | 244201 | 3.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1615776 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 114971 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8193851 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1643592 | |
| - | 1615776 | |
| 0 | 1247462 | |
| 9 | 943290 | |
| 2 | 517481 | 6.3% |
| 8 | 437504 | 5.3% |
| 3 | 388459 | 4.7% |
| 6 | 361794 | 4.4% |
| 7 | 360295 | 4.4% |
| 5 | 319026 | 3.9% |
| Other values (2) | 359172 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8193851 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1643592 | |
| - | 1615776 | |
| 0 | 1247462 | |
| 9 | 943290 | |
| 2 | 517481 | 6.3% |
| 8 | 437504 | 5.3% |
| 3 | 388459 | 4.7% |
| 6 | 361794 | 4.4% |
| 7 | 360295 | 4.4% |
| 5 | 319026 | 3.9% |
| Other values (2) | 359172 | 4.4% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 143292 |
| Missing (%) | 17.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.711362256 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 114 |
|---|---|
| 2nd row | 135 |
| 3rd row | 141 |
| 4th row | 116 |
| 5th row | 121 |
| Value | Count | Frequency (%) |
| 1 | 36360 | 5.2% |
| 182 | 13626 | 2.0% |
| 213 | 11511 | 1.7% |
| 152 | 10863 | 1.6% |
| 121 | 8743 | 1.3% |
| 244 | 6422 | 0.9% |
| 183 | 5707 | 0.8% |
| 274 | 5575 | 0.8% |
| 91 | 5567 | 0.8% |
| 214 | 5131 | 0.7% |
| Other values (356) | 583412 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 425475 | |
| 2 | 367264 | |
| 3 | 221411 | |
| 4 | 136428 | 7.3% |
| 5 | 135758 | 7.2% |
| 8 | 123402 | 6.6% |
| 0 | 120308 | 6.4% |
| 6 | 118857 | 6.3% |
| 9 | 117427 | 6.3% |
| 7 | 112419 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1878749 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 425475 | |
| 2 | 367264 | |
| 3 | 221411 | |
| 4 | 136428 | 7.3% |
| 5 | 135758 | 7.2% |
| 8 | 123402 | 6.6% |
| 0 | 120308 | 6.4% |
| 6 | 118857 | 6.3% |
| 9 | 117427 | 6.3% |
| 7 | 112419 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1878749 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 425475 | |
| 2 | 367264 | |
| 3 | 221411 | |
| 4 | 136428 | 7.3% |
| 5 | 135758 | 7.2% |
| 8 | 123402 | 6.6% |
| 0 | 120308 | 6.4% |
| 6 | 118857 | 6.3% |
| 9 | 117427 | 6.3% |
| 7 | 112419 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1878749 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 425475 | |
| 2 | 367264 | |
| 3 | 221411 | |
| 4 | 136428 | 7.3% |
| 5 | 135758 | 7.2% |
| 8 | 123402 | 6.6% |
| 0 | 120308 | 6.4% |
| 6 | 118857 | 6.3% |
| 9 | 117427 | 6.3% |
| 7 | 112419 | 6.0% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 143292 |
| Missing (%) | 17.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.819598884 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 114 |
|---|---|
| 2nd row | 135 |
| 3rd row | 141 |
| 4th row | 116 |
| 5th row | 151 |
| Value | Count | Frequency (%) |
| 365 | 27667 | 4.0% |
| 212 | 13371 | 1.9% |
| 243 | 10946 | 1.6% |
| 181 | 10761 | 1.6% |
| 151 | 9144 | 1.3% |
| 366 | 8805 | 1.3% |
| 273 | 6217 | 0.9% |
| 120 | 6124 | 0.9% |
| 213 | 5878 | 0.8% |
| 304 | 5369 | 0.8% |
| Other values (356) | 588635 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 391770 | |
| 2 | 363048 | |
| 3 | 263424 | |
| 6 | 158539 | |
| 5 | 156753 | |
| 4 | 142445 | 7.3% |
| 0 | 126544 | 6.5% |
| 8 | 120011 | 6.1% |
| 9 | 117735 | 6.0% |
| 7 | 113479 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1953748 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 391770 | |
| 2 | 363048 | |
| 3 | 263424 | |
| 6 | 158539 | |
| 5 | 156753 | |
| 4 | 142445 | 7.3% |
| 0 | 126544 | 6.5% |
| 8 | 120011 | 6.1% |
| 9 | 117735 | 6.0% |
| 7 | 113479 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1953748 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 391770 | |
| 2 | 363048 | |
| 3 | 263424 | |
| 6 | 158539 | |
| 5 | 156753 | |
| 4 | 142445 | 7.3% |
| 0 | 126544 | 6.5% |
| 8 | 120011 | 6.1% |
| 9 | 117735 | 6.0% |
| 7 | 113479 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1953748 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 391770 | |
| 2 | 363048 | |
| 3 | 263424 | |
| 6 | 158539 | |
| 5 | 156753 | |
| 4 | 142445 | 7.3% |
| 0 | 126544 | 6.5% |
| 8 | 120011 | 6.1% |
| 9 | 117735 | 6.0% |
| 7 | 113479 | 5.8% |
year
Text
Missing 
| Distinct | 286 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 143292 |
| Missing (%) | 17.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1933 |
|---|---|
| 2nd row | 1956 |
| 3rd row | 1939 |
| 4th row | 1955 |
| 5th row | 1838 |
| Value | Count | Frequency (%) |
| 1969 | 12887 | 1.9% |
| 1968 | 12780 | 1.8% |
| 1966 | 12388 | 1.8% |
| 1965 | 11988 | 1.7% |
| 1967 | 11903 | 1.7% |
| 1974 | 11118 | 1.6% |
| 1964 | 11038 | 1.6% |
| 1961 | 11015 | 1.6% |
| 1972 | 10962 | 1.6% |
| 1963 | 10940 | 1.6% |
| Other values (276) | 575898 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 762334 | |
| 9 | 729495 | |
| 8 | 225933 | 8.2% |
| 6 | 191185 | 6.9% |
| 7 | 173174 | 6.2% |
| 0 | 161288 | 5.8% |
| 5 | 156166 | 5.6% |
| 2 | 140469 | 5.1% |
| 3 | 123288 | 4.4% |
| 4 | 108336 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2771668 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 762334 | |
| 9 | 729495 | |
| 8 | 225933 | 8.2% |
| 6 | 191185 | 6.9% |
| 7 | 173174 | 6.2% |
| 0 | 161288 | 5.8% |
| 5 | 156166 | 5.6% |
| 2 | 140469 | 5.1% |
| 3 | 123288 | 4.4% |
| 4 | 108336 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2771668 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 762334 | |
| 9 | 729495 | |
| 8 | 225933 | 8.2% |
| 6 | 191185 | 6.9% |
| 7 | 173174 | 6.2% |
| 0 | 161288 | 5.8% |
| 5 | 156166 | 5.6% |
| 2 | 140469 | 5.1% |
| 3 | 123288 | 4.4% |
| 4 | 108336 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2771668 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 762334 | |
| 9 | 729495 | |
| 8 | 225933 | 8.2% |
| 6 | 191185 | 6.9% |
| 7 | 173174 | 6.2% |
| 0 | 161288 | 5.8% |
| 5 | 156166 | 5.6% |
| 2 | 140469 | 5.1% |
| 3 | 123288 | 4.4% |
| 4 | 108336 | 3.9% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 176649 |
| Missing (%) | 21.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.186186245 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 4 |
| 5th row | 5 |
| Value | Count | Frequency (%) |
| 7 | 93480 | |
| 6 | 79346 | |
| 8 | 76084 | |
| 5 | 71612 | |
| 9 | 56523 | |
| 4 | 53174 | |
| 10 | 49536 | |
| 11 | 42390 | |
| 3 | 42087 | |
| 2 | 33109 | 5.0% |
| Other values (2) | 62219 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 196535 | |
| 7 | 93480 | |
| 6 | 79346 | |
| 8 | 76084 | 9.7% |
| 5 | 71612 | 9.2% |
| 2 | 63984 | 8.2% |
| 9 | 56523 | 7.2% |
| 4 | 53174 | 6.8% |
| 0 | 49536 | 6.3% |
| 3 | 42087 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 782361 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 196535 | |
| 7 | 93480 | |
| 6 | 79346 | |
| 8 | 76084 | 9.7% |
| 5 | 71612 | 9.2% |
| 2 | 63984 | 8.2% |
| 9 | 56523 | 7.2% |
| 4 | 53174 | 6.8% |
| 0 | 49536 | 6.3% |
| 3 | 42087 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 782361 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 196535 | |
| 7 | 93480 | |
| 6 | 79346 | |
| 8 | 76084 | 9.7% |
| 5 | 71612 | 9.2% |
| 2 | 63984 | 8.2% |
| 9 | 56523 | 7.2% |
| 4 | 53174 | 6.8% |
| 0 | 49536 | 6.3% |
| 3 | 42087 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 782361 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 196535 | |
| 7 | 93480 | |
| 6 | 79346 | |
| 8 | 76084 | 9.7% |
| 5 | 71612 | 9.2% |
| 2 | 63984 | 8.2% |
| 9 | 56523 | 7.2% |
| 4 | 53174 | 6.8% |
| 0 | 49536 | 6.3% |
| 3 | 42087 | 5.4% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 258263 |
| Missing (%) | 30.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.716617123 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 24 |
|---|---|
| 2nd row | 14 |
| 3rd row | 21 |
| 4th row | 26 |
| 5th row | 10 |
| Value | Count | Frequency (%) |
| 20 | 21100 | 3.7% |
| 10 | 20702 | 3.6% |
| 15 | 20563 | 3.6% |
| 12 | 20011 | 3.5% |
| 18 | 19984 | 3.5% |
| 25 | 19720 | 3.4% |
| 22 | 19601 | 3.4% |
| 23 | 19480 | 3.4% |
| 17 | 19411 | 3.4% |
| 14 | 19308 | 3.3% |
| Other values (21) | 378066 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 261762 | |
| 2 | 248712 | |
| 3 | 82722 | 8.3% |
| 5 | 58743 | 5.9% |
| 0 | 58587 | 5.9% |
| 8 | 57340 | 5.8% |
| 7 | 56705 | 5.7% |
| 6 | 56290 | 5.7% |
| 4 | 56252 | 5.7% |
| 9 | 54999 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 992112 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 261762 | |
| 2 | 248712 | |
| 3 | 82722 | 8.3% |
| 5 | 58743 | 5.9% |
| 0 | 58587 | 5.9% |
| 8 | 57340 | 5.8% |
| 7 | 56705 | 5.7% |
| 6 | 56290 | 5.7% |
| 4 | 56252 | 5.7% |
| 9 | 54999 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 992112 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 261762 | |
| 2 | 248712 | |
| 3 | 82722 | 8.3% |
| 5 | 58743 | 5.9% |
| 0 | 58587 | 5.9% |
| 8 | 57340 | 5.8% |
| 7 | 56705 | 5.7% |
| 6 | 56290 | 5.7% |
| 4 | 56252 | 5.7% |
| 9 | 54999 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 992112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 261762 | |
| 2 | 248712 | |
| 3 | 82722 | 8.3% |
| 5 | 58743 | 5.9% |
| 0 | 58587 | 5.9% |
| 8 | 57340 | 5.8% |
| 7 | 56705 | 5.7% |
| 6 | 56290 | 5.7% |
| 4 | 56252 | 5.7% |
| 9 | 54999 | 5.5% |
habitat
Text
Missing 
| Distinct | 85802 |
|---|---|
| Distinct (%) | 57.4% |
| Missing | 686738 |
| Missing (%) | 82.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 38739 |
|---|---|
| Median length | 444 |
| Mean length | 40.11896622 |
| Min length | 1 |
Unique
| Unique | 70802 ? |
|---|---|
| Unique (%) | 47.4% |
Sample
| 1st row | Old forest |
|---|---|
| 2nd row | Old forest Very scanty |
| 3rd row | Old forest, steep ridge |
| 4th row | Old forest, clayey soil, sloping country, scanty |
| 5th row | Degrade forest |
| Value | Count | Frequency (%) |
| forest | 69458 | 7.8% |
| in | 32271 | 3.6% |
| on | 27752 | 3.1% |
| of | 15232 | 1.7% |
| soil | 14658 | 1.6% |
| primary | 13760 | 1.5% |
| with | 12081 | 1.4% |
| secondary | 11807 | 1.3% |
| the | 11193 | 1.3% |
| along | 10578 | 1.2% |
| Other values (37367) | 672044 |
Most occurring characters
| Value | Count | Frequency (%) |
| 740532 | 12.3% | |
| e | 583099 | 9.7% |
| r | 430338 | 7.2% |
| a | 415456 | 6.9% |
| o | 409182 | 6.8% |
| n | 340698 | 5.7% |
| s | 329561 | 5.5% |
| t | 306586 | 5.1% |
| i | 299738 | 5.0% |
| l | 234073 | 3.9% |
| Other values (143) | 1907359 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4733336 | |
| Space Separator | 740532 | 12.3% |
| Other Punctuation | 242637 | 4.0% |
| Uppercase Letter | 221438 | 3.7% |
| Decimal Number | 25398 | 0.4% |
| Dash Punctuation | 13197 | 0.2% |
| Control | 8046 | 0.1% |
| Open Punctuation | 4942 | 0.1% |
| Close Punctuation | 4924 | 0.1% |
| Math Symbol | 1828 | < 0.1% |
| Other values (8) | 344 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 583099 | |
| r | 430338 | 9.1% |
| a | 415456 | 8.8% |
| o | 409182 | 8.6% |
| n | 340698 | 7.2% |
| s | 329561 | 7.0% |
| t | 306586 | 6.5% |
| i | 299738 | 6.3% |
| l | 234073 | 4.9% |
| d | 233984 | 4.9% |
| Other values (46) | 1150621 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 28522 | |
| O | 21270 | 9.6% |
| P | 17531 | 7.9% |
| I | 13516 | 6.1% |
| F | 13391 | 6.0% |
| R | 12633 | 5.7% |
| A | 12186 | 5.5% |
| D | 11776 | 5.3% |
| C | 11713 | 5.3% |
| M | 11182 | 5.0% |
| Other values (29) | 67718 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 154594 | |
| , | 66676 | |
| ; | 13156 | 5.4% |
| ' | 2774 | 1.1% |
| / | 2038 | 0.8% |
| : | 1553 | 0.6% |
| & | 617 | 0.3% |
| ? | 553 | 0.2% |
| " | 368 | 0.2% |
| % | 178 | 0.1% |
| Other values (7) | 130 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6268 | |
| 1 | 3980 | |
| 2 | 3363 | |
| 5 | 3066 | |
| 3 | 2053 | 8.1% |
| 4 | 1898 | 7.5% |
| 6 | 1305 | 5.1% |
| 9 | 1273 | 5.0% |
| 7 | 1102 | 4.3% |
| 8 | 1090 | 4.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1433 | |
| ± | 152 | 8.3% |
| | | 88 | 4.8% |
| = | 70 | 3.8% |
| > | 40 | 2.2% |
| < | 36 | 2.0% |
| ~ | 8 | 0.4% |
| × | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4598 | |
| [ | 341 | 6.9% |
| ‚ | 2 | < 0.1% |
| { | 1 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 25 | |
| ² | 16 | |
| ¼ | 1 | 2.4% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 7 | |
| ^ | 2 | 20.0% |
| ´ | 1 | 10.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13183 | |
| – | 14 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 8010 | ||
| 36 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4589 | |
| ] | 335 | 6.8% |
Space Separator
| Value | Count | Frequency (%) |
| 740532 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 186 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 85 |
Other Letter
| Value | Count | Frequency (%) |
| º | 10 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 5 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 5 |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4954782 | |
| Common | 1041840 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 583099 | |
| r | 430338 | 8.7% |
| a | 415456 | 8.4% |
| o | 409182 | 8.3% |
| n | 340698 | 6.9% |
| s | 329561 | 6.7% |
| t | 306586 | 6.2% |
| i | 299738 | 6.0% |
| l | 234073 | 4.7% |
| d | 233984 | 4.7% |
| Other values (85) | 1372067 |
Common
| Value | Count | Frequency (%) |
| 740532 | ||
| . | 154594 | 14.8% |
| , | 66676 | 6.4% |
| - | 13183 | 1.3% |
| ; | 13156 | 1.3% |
| 8010 | 0.8% | |
| 0 | 6268 | 0.6% |
| ( | 4598 | 0.4% |
| ) | 4589 | 0.4% |
| 1 | 3980 | 0.4% |
| Other values (48) | 26254 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5980939 | |
| None | 15655 | 0.3% |
| Punctuation | 28 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 740532 | 12.4% | |
| e | 583099 | 9.7% |
| r | 430338 | 7.2% |
| a | 415456 | 6.9% |
| o | 409182 | 6.8% |
| n | 340698 | 5.7% |
| s | 329561 | 5.5% |
| t | 306586 | 5.1% |
| i | 299738 | 5.0% |
| l | 234073 | 3.9% |
| Other values (84) | 1891676 |
None
| Value | Count | Frequency (%) |
| é | 4841 | |
| ê | 4190 | |
| è | 2304 | |
| à | 1106 | 7.1% |
| á | 552 | 3.5% |
| ä | 402 | 2.6% |
| ü | 262 | 1.7% |
| í | 240 | 1.5% |
| ú | 191 | 1.2% |
| ó | 183 | 1.2% |
| Other values (44) | 1384 | 8.8% |
Punctuation
| Value | Count | Frequency (%) |
| – | 14 | |
| ” | 5 | 17.9% |
| “ | 5 | 17.9% |
| ‚ | 2 | 7.1% |
| † | 2 | 7.1% |
sampleSizeValue
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0.0 m |
|---|
| Value | Count | Frequency (%) |
| 0.0 | 1 | |
| m | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 | |
| 1 | ||
| m | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 | |
| Other Punctuation | 1 | |
| Space Separator | 1 | |
| Lowercase Letter | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 | |
| Latin | 1 | 20.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 | |
| 1 |
Latin
| Value | Count | Frequency (%) |
| m | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 | |
| 1 | ||
| m | 1 |
higherGeography
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 51.41942 |
|---|
| Value | Count | Frequency (%) |
| 51.41942 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| . | 1 | |
| 9 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 | |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| 9 | 1 | |
| 2 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| . | 1 | |
| 9 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| . | 1 | |
| 9 | 1 | |
| 2 | 1 |
continent
Text
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 150752 |
| Missing (%) | 18.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 6.548962225 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | ASIA |
| 3rd row | EUROPE |
| 4th row | ASIA |
| 5th row | EUROPE |
| Value | Count | Frequency (%) |
| asia | 209000 | |
| europe | 197368 | |
| africa | 113320 | |
| south_america | 65750 | 9.6% |
| oceania | 60888 | 8.9% |
| north_america | 38877 | 5.7% |
| antarctica | 253 | < 0.1% |
| 3.69787 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 976429 | |
| E | 560251 | |
| I | 488088 | |
| R | 454445 | |
| O | 362883 | 8.1% |
| C | 279341 | 6.2% |
| S | 274750 | 6.1% |
| U | 263118 | 5.9% |
| P | 197368 | 4.4% |
| F | 113320 | 2.5% |
| Other values (11) | 519039 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4384398 | |
| Connector Punctuation | 104627 | 2.3% |
| Decimal Number | 6 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 976429 | |
| E | 560251 | |
| I | 488088 | |
| R | 454445 | |
| O | 362883 | 8.3% |
| C | 279341 | 6.4% |
| S | 274750 | 6.3% |
| U | 263118 | 6.0% |
| P | 197368 | 4.5% |
| F | 113320 | 2.6% |
| Other values (4) | 414405 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 3 | 1 | |
| 6 | 1 | |
| 9 | 1 | |
| 8 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 104627 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4384398 | |
| Common | 104634 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 976429 | |
| E | 560251 | |
| I | 488088 | |
| R | 454445 | |
| O | 362883 | 8.3% |
| C | 279341 | 6.4% |
| S | 274750 | 6.3% |
| U | 263118 | 6.0% |
| P | 197368 | 4.5% |
| F | 113320 | 2.6% |
| Other values (4) | 414405 |
Common
| Value | Count | Frequency (%) |
| _ | 104627 | |
| 7 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4489032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 976429 | |
| E | 560251 | |
| I | 488088 | |
| R | 454445 | |
| O | 362883 | 8.1% |
| C | 279341 | 6.2% |
| S | 274750 | 6.1% |
| U | 263118 | 5.9% |
| P | 197368 | 4.4% |
| F | 113320 | 2.5% |
| Other values (11) | 519039 |
countryCode
Text
| Distinct | 233 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3342 |
| Missing (%) | 0.4% |
| Memory size | 6.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | FR |
|---|---|
| 2nd row | ID |
| 3rd row | FR |
| 4th row | ID |
| 5th row | GR |
| Value | Count | Frequency (%) |
| zz | 149452 | |
| nl | 119066 | 14.3% |
| id | 96368 | 11.6% |
| my | 37231 | 4.5% |
| pg | 26154 | 3.1% |
| br | 18998 | 2.3% |
| fr | 18916 | 2.3% |
| us | 18613 | 2.2% |
| au | 18586 | 2.2% |
| th | 18570 | 2.2% |
| Other values (223) | 310913 |
Most occurring characters
| Value | Count | Frequency (%) |
| Z | 324591 | |
| N | 152347 | 9.1% |
| L | 133105 | 8.0% |
| I | 124929 | 7.5% |
| D | 113510 | 6.8% |
| M | 73244 | 4.4% |
| G | 71010 | 4.3% |
| C | 67123 | 4.0% |
| P | 64783 | 3.9% |
| R | 63303 | 3.8% |
| Other values (16) | 477789 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1665734 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 324591 | |
| N | 152347 | 9.1% |
| L | 133105 | 8.0% |
| I | 124929 | 7.5% |
| D | 113510 | 6.8% |
| M | 73244 | 4.4% |
| G | 71010 | 4.3% |
| C | 67123 | 4.0% |
| P | 64783 | 3.9% |
| R | 63303 | 3.8% |
| Other values (16) | 477789 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1665734 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Z | 324591 | |
| N | 152347 | 9.1% |
| L | 133105 | 8.0% |
| I | 124929 | 7.5% |
| D | 113510 | 6.8% |
| M | 73244 | 4.4% |
| G | 71010 | 4.3% |
| C | 67123 | 4.0% |
| P | 64783 | 3.9% |
| R | 63303 | 3.8% |
| Other values (16) | 477789 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1665734 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Z | 324591 | |
| N | 152347 | 9.1% |
| L | 133105 | 8.0% |
| I | 124929 | 7.5% |
| D | 113510 | 6.8% |
| M | 73244 | 4.4% |
| G | 71010 | 4.3% |
| C | 67123 | 4.0% |
| P | 64783 | 3.9% |
| R | 63303 | 3.8% |
| Other values (16) | 477789 |
stateProvince
Text
Missing 
| Distinct | 2396 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 512889 |
| Missing (%) | 61.3% |
| Memory size | 6.4 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 27 |
| Mean length | 8.84108623 |
| Min length | 3 |
Unique
| Unique | 478 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sumatra |
|---|---|
| 2nd row | Borneo |
| 3rd row | Borneo |
| 4th row | Sumatra |
| 5th row | Sumatra |
| Value | Count | Frequency (%) |
| borneo | 39582 | 9.7% |
| new | 35029 | 8.6% |
| guinea | 32739 | 8.0% |
| java | 22932 | 5.6% |
| sumatra | 14157 | 3.5% |
| region | 13293 | 3.2% |
| northern | 9291 | 2.3% |
| zuid-holland | 8882 | 2.2% |
| gelderland | 7192 | 1.8% |
| sulawesi | 6585 | 1.6% |
| Other values (2460) | 219524 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 320355 | 11.2% |
| e | 273195 | 9.6% |
| o | 241553 | 8.5% |
| n | 240670 | 8.4% |
| r | 190437 | 6.7% |
| u | 144597 | 5.1% |
| i | 143886 | 5.0% |
| l | 111277 | 3.9% |
| t | 104424 | 3.7% |
| s | 91117 | 3.2% |
| Other values (91) | 996989 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2274891 | |
| Uppercase Letter | 448270 | 15.7% |
| Space Separator | 85886 | 3.0% |
| Dash Punctuation | 42062 | 1.5% |
| Open Punctuation | 2935 | 0.1% |
| Close Punctuation | 2908 | 0.1% |
| Other Punctuation | 1494 | 0.1% |
| Decimal Number | 37 | < 0.1% |
| Final Punctuation | 16 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 320355 | |
| e | 273195 | |
| o | 241553 | |
| n | 240670 | |
| r | 190437 | |
| u | 144597 | 6.4% |
| i | 143886 | 6.3% |
| l | 111277 | 4.9% |
| t | 104424 | 4.6% |
| s | 91117 | 4.0% |
| Other values (39) | 413380 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 66536 | |
| S | 52217 | |
| B | 50547 | |
| G | 46862 | |
| J | 23670 | 5.3% |
| M | 21688 | 4.8% |
| L | 20607 | 4.6% |
| H | 19364 | 4.3% |
| R | 16592 | 3.7% |
| C | 14802 | 3.3% |
| Other values (21) | 115385 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 9 | |
| 7 | 7 | |
| 6 | 5 | |
| 5 | 5 | |
| 2 | 4 | |
| 3 | 4 | |
| 8 | 2 | 5.4% |
| 1 | 1 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 867 | |
| ' | 456 | |
| , | 158 | 10.6% |
| ? | 5 | 0.3% |
| & | 5 | 0.3% |
| / | 3 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42034 | |
| – | 28 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 85886 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2935 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2908 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 16 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2723161 | |
| Common | 135339 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 320355 | 11.8% |
| e | 273195 | 10.0% |
| o | 241553 | 8.9% |
| n | 240670 | 8.8% |
| r | 190437 | 7.0% |
| u | 144597 | 5.3% |
| i | 143886 | 5.3% |
| l | 111277 | 4.1% |
| t | 104424 | 3.8% |
| s | 91117 | 3.3% |
| Other values (70) | 861650 |
Common
| Value | Count | Frequency (%) |
| 85886 | ||
| - | 42034 | |
| ( | 2935 | 2.2% |
| ) | 2908 | 2.1% |
| . | 867 | 0.6% |
| ' | 456 | 0.3% |
| , | 158 | 0.1% |
| – | 28 | < 0.1% |
| ’ | 16 | < 0.1% |
| 4 | 9 | < 0.1% |
| Other values (11) | 42 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2836154 | |
| None | 22302 | 0.8% |
| Punctuation | 44 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 320355 | 11.3% |
| e | 273195 | 9.6% |
| o | 241553 | 8.5% |
| n | 240670 | 8.5% |
| r | 190437 | 6.7% |
| u | 144597 | 5.1% |
| i | 143886 | 5.1% |
| l | 111277 | 3.9% |
| t | 104424 | 3.7% |
| s | 91117 | 3.2% |
| Other values (61) | 974643 |
None
| Value | Count | Frequency (%) |
| é | 15488 | |
| á | 2406 | 10.8% |
| í | 1047 | 4.7% |
| ô | 639 | 2.9% |
| ó | 629 | 2.8% |
| ü | 572 | 2.6% |
| ä | 289 | 1.3% |
| ã | 266 | 1.2% |
| è | 179 | 0.8% |
| ö | 118 | 0.5% |
| Other values (18) | 669 | 3.0% |
Punctuation
| Value | Count | Frequency (%) |
| – | 28 | |
| ’ | 16 |
locality
Text
Missing 
| Distinct | 529376 |
|---|---|
| Distinct (%) | 74.3% |
| Missing | 123808 |
| Missing (%) | 14.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 1249522 |
|---|---|
| Median length | 342 |
| Mean length | 47.85982052 |
| Min length | 1 |
Unique
| Unique | 472473 ? |
|---|---|
| Unique (%) | 66.3% |
Sample
| 1st row | Nice. |
|---|---|
| 2nd row | E. Coast Sumatra, Siak, Indrapura |
| 3rd row | Corsica; Cargèse. |
| 4th row | Patras, op rots, bij ruine. |
| 5th row | West Borneo, Sintang G. Pahoe |
| Value | Count | Frequency (%) |
| of | 163058 | 3.2% |
| de | 87146 | 1.7% |
| km | 68383 | 1.4% |
| 60966 | 1.2% | |
| in | 53973 | 1.1% |
| the | 38597 | 0.8% |
| near | 36540 | 0.7% |
| road | 35833 | 0.7% |
| bij | 34165 | 0.7% |
| district | 32502 | 0.6% |
| Other values (379124) | 4425190 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4213362 | 12.4% | |
| a | 2982924 | 8.7% |
| e | 2490880 | 7.3% |
| n | 1922345 | 5.6% |
| o | 1814720 | 5.3% |
| i | 1762625 | 5.2% |
| r | 1675236 | 4.9% |
| t | 1309915 | 3.8% |
| . | 1270912 | 3.7% |
| l | 1158859 | 3.4% |
| Other values (193) | 13493606 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22696643 | |
| Space Separator | 4213362 | 12.4% |
| Uppercase Letter | 3524739 | 10.3% |
| Other Punctuation | 2247204 | 6.6% |
| Decimal Number | 709738 | 2.1% |
| Control | 383114 | 1.1% |
| Dash Punctuation | 136444 | 0.4% |
| Open Punctuation | 80934 | 0.2% |
| Close Punctuation | 80592 | 0.2% |
| Math Symbol | 11289 | < 0.1% |
| Other values (9) | 11325 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2982924 | |
| e | 2490880 | |
| n | 1922345 | 8.5% |
| o | 1814720 | 8.0% |
| i | 1762625 | 7.8% |
| r | 1675236 | 7.4% |
| t | 1309915 | 5.8% |
| l | 1158859 | 5.1% |
| s | 1157830 | 5.1% |
| u | 870000 | 3.8% |
| Other values (57) | 5551309 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 333610 | 9.5% |
| P | 267959 | 7.6% |
| M | 248778 | 7.1% |
| B | 240852 | 6.8% |
| N | 221270 | 6.3% |
| C | 207032 | 5.9% |
| A | 190780 | 5.4% |
| T | 169441 | 4.8% |
| R | 167041 | 4.7% |
| L | 152962 | 4.3% |
| Other values (46) | 1325014 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1270912 | |
| , | 717111 | |
| : | 113198 | 5.0% |
| ; | 34269 | 1.5% |
| ' | 29687 | 1.3% |
| / | 23803 | 1.1% |
| ! | 19102 | 0.9% |
| * | 18825 | 0.8% |
| " | 10856 | 0.5% |
| ? | 5322 | 0.2% |
| Other values (10) | 4119 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 130535 | |
| 0 | 99409 | |
| 2 | 99351 | |
| 5 | 70127 | |
| 4 | 67431 | |
| 3 | 63641 | |
| 6 | 53404 | |
| 7 | 47821 | 6.7% |
| 8 | 41309 | 5.8% |
| 9 | 36710 | 5.2% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 3498 | |
| ± | 3405 | |
| = | 2485 | |
| > | 896 | 7.9% |
| < | 621 | 5.5% |
| + | 341 | 3.0% |
| × | 28 | 0.2% |
| ~ | 14 | 0.1% |
| ÷ | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 73371 | |
| [ | 7435 | 9.2% |
| „ | 83 | 0.1% |
| ‚ | 30 | < 0.1% |
| { | 15 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 907 | |
| ¼ | 183 | 15.2% |
| ¾ | 87 | 7.2% |
| ² | 19 | 1.6% |
| ³ | 9 | 0.7% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 30 | |
| ’ | 8 | 17.8% |
| › | 4 | 8.9% |
| ” | 3 | 6.7% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 22 | |
| ‹ | 9 | |
| “ | 3 | 8.6% |
| ‘ | 1 | 2.9% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 15 | |
| ` | 12 | |
| ^ | 4 | 12.1% |
| ¨ | 2 | 6.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 136425 | |
| – | 13 | < 0.1% |
| — | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 73201 | |
| ] | 7384 | 9.2% |
| } | 7 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 918 | |
| ® | 3 | 0.3% |
| ¦ | 1 | 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 2 | |
| ¤ | 1 | |
| $ | 1 |
Control
| Value | Count | Frequency (%) |
| 381396 | ||
| 1718 | 0.4% |
Other Letter
| Value | Count | Frequency (%) |
| º | 204 | |
| ª | 5 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 4213362 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8870 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26221591 | |
| Common | 7873793 | 23.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2982924 | 11.4% |
| e | 2490880 | 9.5% |
| n | 1922345 | 7.3% |
| o | 1814720 | 6.9% |
| i | 1762625 | 6.7% |
| r | 1675236 | 6.4% |
| t | 1309915 | 5.0% |
| l | 1158859 | 4.4% |
| s | 1157830 | 4.4% |
| u | 870000 | 3.3% |
| Other values (115) | 9076257 |
Common
| Value | Count | Frequency (%) |
| 4213362 | ||
| . | 1270912 | 16.1% |
| , | 717111 | 9.1% |
| 381396 | 4.8% | |
| - | 136425 | 1.7% |
| 1 | 130535 | 1.7% |
| : | 113198 | 1.4% |
| 0 | 99409 | 1.3% |
| 2 | 99351 | 1.3% |
| ( | 73371 | 0.9% |
| Other values (68) | 638723 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33984659 | |
| None | 110534 | 0.3% |
| Punctuation | 183 | < 0.1% |
| Latin Ext Additional | 6 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4213362 | 12.4% | |
| a | 2982924 | 8.8% |
| e | 2490880 | 7.3% |
| n | 1922345 | 5.7% |
| o | 1814720 | 5.3% |
| i | 1762625 | 5.2% |
| r | 1675236 | 4.9% |
| t | 1309915 | 3.9% |
| . | 1270912 | 3.7% |
| l | 1158859 | 3.4% |
| Other values (87) | 13382881 |
None
| Value | Count | Frequency (%) |
| é | 38543 | |
| á | 8087 | 7.3% |
| è | 7944 | 7.2% |
| ü | 6159 | 5.6% |
| ö | 4779 | 4.3% |
| í | 4351 | 3.9% |
| ë | 4307 | 3.9% |
| ä | 3869 | 3.5% |
| ó | 3731 | 3.4% |
| ê | 3719 | 3.4% |
| Other values (79) | 25045 |
Punctuation
| Value | Count | Frequency (%) |
| „ | 83 | |
| ‚ | 30 | 16.4% |
| – | 13 | 7.1% |
| ‰ | 10 | 5.5% |
| ‹ | 9 | 4.9% |
| ’ | 8 | 4.4% |
| ‡ | 8 | 4.4% |
| — | 6 | 3.3% |
| … | 5 | 2.7% |
| › | 4 | 2.2% |
| Other values (3) | 7 | 3.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 3 | |
| ủ | 2 | |
| ồ | 1 | 16.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˆ | 2 |
Missing 
| Distinct | 4412 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 540040 |
| Missing (%) | 64.6% |
| Memory size | 6.4 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 5 |
| Mean length | 6.284624657 |
| Min length | 5 |
Unique
| Unique | 1520 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 10.0 m |
|---|---|
| 2nd row | 600.0 m |
| 3rd row | 250.0 m |
| 4th row | 20.0 m |
| 5th row | 4.0 m |
| Value | Count | Frequency (%) |
| m | 296169 | |
| 0.0 | 166548 | |
| 13757 | 2.2% | |
| 100.0 | 4950 | 0.8% |
| 200.0 | 4534 | 0.7% |
| 50.0 | 4284 | 0.7% |
| 300.0 | 3487 | 0.6% |
| 400.0 | 3461 | 0.6% |
| 500.0 | 3399 | 0.5% |
| 1000.0 | 3080 | 0.5% |
| Other values (2344) | 116183 | 18.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 647657 | |
| 323683 | ||
| . | 309926 | |
| m | 296169 | |
| 1 | 62594 | 3.4% |
| 5 | 51541 | 2.8% |
| 2 | 41404 | 2.2% |
| 3 | 26542 | 1.4% |
| 4 | 21812 | 1.2% |
| 6 | 19042 | 1.0% |
| Other values (4) | 60941 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 917406 | |
| Space Separator | 323683 | 17.4% |
| Other Punctuation | 309926 | 16.7% |
| Lowercase Letter | 296169 | 15.9% |
| Dash Punctuation | 14127 | 0.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 647657 | |
| 1 | 62594 | 6.8% |
| 5 | 51541 | 5.6% |
| 2 | 41404 | 4.5% |
| 3 | 26542 | 2.9% |
| 4 | 21812 | 2.4% |
| 6 | 19042 | 2.1% |
| 7 | 18170 | 2.0% |
| 8 | 15643 | 1.7% |
| 9 | 13001 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 323683 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 309926 |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 296169 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14127 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1565142 | |
| Latin | 296169 | 15.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 647657 | |
| 323683 | ||
| . | 309926 | |
| 1 | 62594 | 4.0% |
| 5 | 51541 | 3.3% |
| 2 | 41404 | 2.6% |
| 3 | 26542 | 1.7% |
| 4 | 21812 | 1.4% |
| 6 | 19042 | 1.2% |
| 7 | 18170 | 1.2% |
| Other values (3) | 42771 | 2.7% |
Latin
| Value | Count | Frequency (%) |
| m | 296169 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1861311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 647657 | |
| 323683 | ||
| . | 309926 | |
| m | 296169 | |
| 1 | 62594 | 3.4% |
| 5 | 51541 | 2.8% |
| 2 | 41404 | 2.2% |
| 3 | 26542 | 1.4% |
| 4 | 21812 | 1.2% |
| 6 | 19042 | 1.0% |
| Other values (4) | 60941 | 3.3% |
decimalLatitude
Text
Missing 
| Distinct | 37379 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 483055 |
| Missing (%) | 57.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.959269894 |
| Min length | 3 |
Unique
| Unique | 19749 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | -2.06667 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | -2.18333 |
| 4th row | -2.18333 |
| 5th row | 1.16667 |
| Value | Count | Frequency (%) |
| 52.16011 | 2387 | 0.7% |
| 7.25 | 1447 | 0.4% |
| 5.83333 | 1431 | 0.4% |
| 1.0 | 1312 | 0.4% |
| 3.08333 | 1267 | 0.4% |
| 6.08333 | 1210 | 0.3% |
| 52.14714 | 1142 | 0.3% |
| 51.83515 | 1140 | 0.3% |
| 5.33333 | 1113 | 0.3% |
| 5.38333 | 1100 | 0.3% |
| Other values (34950) | 339605 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 365142 | |
| . | 353154 | |
| 6 | 265229 | |
| 5 | 263570 | |
| 1 | 243547 | |
| 7 | 180077 | |
| 2 | 178855 | |
| 8 | 152951 | |
| 4 | 124787 | 5.1% |
| 0 | 124225 | 5.1% |
| Other values (3) | 206157 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2003589 | |
| Other Punctuation | 353154 | 14.4% |
| Dash Punctuation | 100948 | 4.1% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 365142 | |
| 6 | 265229 | |
| 5 | 263570 | |
| 1 | 243547 | |
| 7 | 180077 | |
| 2 | 178855 | |
| 8 | 152951 | |
| 4 | 124787 | 6.2% |
| 0 | 124225 | 6.2% |
| 9 | 105206 | 5.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 353154 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 100948 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2457691 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 365142 | |
| . | 353154 | |
| 6 | 265229 | |
| 5 | 263570 | |
| 1 | 243547 | |
| 7 | 180077 | |
| 2 | 178855 | |
| 8 | 152951 | |
| 4 | 124787 | 5.1% |
| 0 | 124225 | 5.1% |
| Other values (2) | 206154 |
Latin
| Value | Count | Frequency (%) |
| E | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2457694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 365142 | |
| . | 353154 | |
| 6 | 265229 | |
| 5 | 263570 | |
| 1 | 243547 | |
| 7 | 180077 | |
| 2 | 178855 | |
| 8 | 152951 | |
| 4 | 124787 | 5.1% |
| 0 | 124225 | 5.1% |
| Other values (3) | 206157 |
decimalLongitude
Text
Missing 
| Distinct | 43205 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 483055 |
| Missing (%) | 57.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.300126857 |
| Min length | 3 |
Unique
| Unique | 21461 ? |
|---|---|
| Unique (%) | 6.1% |
Sample
| 1st row | 100.93333 |
|---|---|
| 2nd row | 112.0 |
| 3rd row | 99.65 |
| 4th row | 99.65 |
| 5th row | 124.58333 |
| Value | Count | Frequency (%) |
| 4.49701 | 2387 | 0.7% |
| 10.41667 | 1222 | 0.3% |
| 4.05 | 1206 | 0.3% |
| 5.85874 | 1140 | 0.3% |
| 3.01667 | 1134 | 0.3% |
| 4.47406 | 1109 | 0.3% |
| 4.90993 | 911 | 0.3% |
| 4.32798 | 895 | 0.3% |
| 4.47863 | 871 | 0.2% |
| 106.7913 | 866 | 0.2% |
| Other values (41279) | 341413 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 358900 | |
| . | 353154 | |
| 1 | 351204 | |
| 6 | 304827 | |
| 7 | 199473 | |
| 5 | 198837 | |
| 4 | 184388 | |
| 8 | 155506 | |
| 9 | 144673 | |
| 0 | 144241 | |
| Other values (2) | 182866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2180349 | |
| Other Punctuation | 353154 | 13.7% |
| Dash Punctuation | 44566 | 1.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 358900 | |
| 1 | 351204 | |
| 6 | 304827 | |
| 7 | 199473 | |
| 5 | 198837 | |
| 4 | 184388 | |
| 8 | 155506 | |
| 9 | 144673 | |
| 0 | 144241 | |
| 2 | 138300 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 353154 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 44566 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2578069 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 358900 | |
| . | 353154 | |
| 1 | 351204 | |
| 6 | 304827 | |
| 7 | 199473 | |
| 5 | 198837 | |
| 4 | 184388 | |
| 8 | 155506 | |
| 9 | 144673 | |
| 0 | 144241 | |
| Other values (2) | 182866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2578069 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 358900 | |
| . | 353154 | |
| 1 | 351204 | |
| 6 | 304827 | |
| 7 | 199473 | |
| 5 | 198837 | |
| 4 | 184388 | |
| 8 | 155506 | |
| 9 | 144673 | |
| 0 | 144241 | |
| Other values (2) | 182866 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Bakker S |
|---|
| Value | Count | Frequency (%) |
| bakker | 1 | |
| s | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| k | 2 | |
| B | 1 | |
| a | 1 | |
| e | 1 | |
| r | 1 | |
| 1 | ||
| S | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 | |
| Uppercase Letter | 2 | 25.0% |
| Space Separator | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| k | 2 | |
| a | 1 | |
| e | 1 | |
| r | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 | |
| Common | 1 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| k | 2 | |
| B | 1 | |
| a | 1 | |
| e | 1 | |
| r | 1 | |
| S | 1 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| k | 2 | |
| B | 1 | |
| a | 1 | |
| e | 1 | |
| r | 1 | |
| 1 | ||
| S | 1 |
highestBiostratigraphicZone
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2608920 |
|---|
| Value | Count | Frequency (%) |
| 2608920 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
identificationID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 30 |
| Mean length | 30 |
| Min length | 30 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physcia caesia (Hoffm.) Fürnr. |
|---|
| Value | Count | Frequency (%) |
| physcia | 1 | |
| caesia | 1 | |
| hoffm | 1 | |
| fürnr | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | 10.0% |
| 3 | 10.0% | |
| r | 2 | 6.7% |
| s | 2 | 6.7% |
| c | 2 | 6.7% |
| i | 2 | 6.7% |
| . | 2 | 6.7% |
| f | 2 | 6.7% |
| P | 1 | 3.3% |
| m | 1 | 3.3% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 | |
| Space Separator | 3 | 10.0% |
| Uppercase Letter | 3 | 10.0% |
| Other Punctuation | 2 | 6.7% |
| Close Punctuation | 1 | 3.3% |
| Open Punctuation | 1 | 3.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| r | 2 | |
| s | 2 | |
| c | 2 | |
| i | 2 | |
| f | 2 | |
| m | 1 | 5.0% |
| ü | 1 | 5.0% |
| o | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| F | 1 | |
| H | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23 | |
| Common | 7 | 23.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| r | 2 | 8.7% |
| s | 2 | 8.7% |
| c | 2 | 8.7% |
| i | 2 | 8.7% |
| f | 2 | 8.7% |
| P | 1 | 4.3% |
| m | 1 | 4.3% |
| ü | 1 | 4.3% |
| F | 1 | 4.3% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| . | 2 | |
| ) | 1 | 14.3% |
| ( | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 | |
| None | 1 | 3.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | 10.3% |
| 3 | 10.3% | |
| r | 2 | 6.9% |
| s | 2 | 6.9% |
| c | 2 | 6.9% |
| i | 2 | 6.9% |
| . | 2 | 6.9% |
| f | 2 | 6.9% |
| P | 1 | 3.4% |
| m | 1 | 3.4% |
| Other values (9) | 9 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
typeStatus
Text
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 822537 |
| Missing (%) | 98.4% |
| Memory size | 6.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.001828555 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HOLOTYPE |
|---|---|
| 2nd row | ISOTYPE |
| 3rd row | TYPE |
| 4th row | LECTOTYPE |
| 5th row | TYPE |
| Value | Count | Frequency (%) |
| isotype | 6179 | |
| holotype | 2289 | 16.7% |
| type | 2175 | 15.9% |
| syntype | 1373 | 10.0% |
| paratype | 448 | 3.3% |
| isolectotype | 443 | 3.2% |
| lectotype | 436 | 3.2% |
| isosyntype | 179 | 1.3% |
| neotype | 83 | 0.6% |
| isoneotype | 51 | 0.4% |
| Other values (3) | 16 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 15224 | |
| E | 14695 | |
| T | 14562 | |
| P | 14136 | |
| O | 12460 | |
| S | 8404 | |
| I | 6857 | |
| L | 3173 | 3.3% |
| H | 2289 | 2.4% |
| N | 1686 | 1.8% |
| Other values (3) | 2243 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 95729 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 15224 | |
| E | 14695 | |
| T | 14562 | |
| P | 14136 | |
| O | 12460 | |
| S | 8404 | |
| I | 6857 | |
| L | 3173 | 3.3% |
| H | 2289 | 2.4% |
| N | 1686 | 1.8% |
| Other values (3) | 2243 | 2.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 95729 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 15224 | |
| E | 14695 | |
| T | 14562 | |
| P | 14136 | |
| O | 12460 | |
| S | 8404 | |
| I | 6857 | |
| L | 3173 | 3.3% |
| H | 2289 | 2.4% |
| N | 1686 | 1.8% |
| Other values (3) | 2243 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95729 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 15224 | |
| E | 14695 | |
| T | 14562 | |
| P | 14136 | |
| O | 12460 | |
| S | 8404 | |
| I | 6857 | |
| L | 3173 | 3.3% |
| H | 2289 | 2.4% |
| N | 1686 | 1.8% |
| Other values (3) | 2243 | 2.3% |
identifiedBy
Text
Missing 
| Distinct | 6403 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 693965 |
| Missing (%) | 83.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 66 |
|---|---|
| Median length | 48 |
| Mean length | 11.41105424 |
| Min length | 1 |
Unique
| Unique | 2525 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | Wood GHS |
|---|---|
| 2nd row | Steenis CGGJ van |
| 3rd row | Pereira JT; Wong KM |
| 4th row | Ashton PS |
| 5th row | Nooteboom HP |
| Value | Count | Frequency (%) |
| van | 14889 | 4.5% |
| de | 8088 | 2.4% |
| der | 4298 | 1.3% |
| p | 4266 | 1.3% |
| a | 4145 | 1.2% |
| j | 3993 | 1.2% |
| maas | 3807 | 1.1% |
| pc | 3671 | 1.1% |
| d | 3624 | 1.1% |
| cch | 3447 | 1.0% |
| Other values (5846) | 278934 |
Most occurring characters
| Value | Count | Frequency (%) |
| 190918 | 11.8% | |
| e | 147790 | 9.1% |
| n | 106027 | 6.5% |
| a | 98234 | 6.1% |
| r | 73830 | 4.5% |
| o | 68233 | 4.2% |
| J | 57762 | 3.6% |
| i | 54850 | 3.4% |
| s | 53134 | 3.3% |
| l | 50757 | 3.1% |
| Other values (91) | 721619 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 953756 | |
| Uppercase Letter | 462951 | |
| Space Separator | 190918 | 11.8% |
| Other Punctuation | 9766 | 0.6% |
| Dash Punctuation | 5566 | 0.3% |
| Open Punctuation | 90 | < 0.1% |
| Close Punctuation | 90 | < 0.1% |
| Decimal Number | 16 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 147790 | |
| n | 106027 | |
| a | 98234 | |
| r | 73830 | 7.7% |
| o | 68233 | 7.2% |
| i | 54850 | 5.8% |
| s | 53134 | 5.6% |
| l | 50757 | 5.3% |
| d | 45593 | 4.8% |
| t | 33486 | 3.5% |
| Other values (39) | 221822 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 57762 | |
| C | 39019 | 8.4% |
| M | 37667 | 8.1% |
| H | 34790 | 7.5% |
| A | 32419 | 7.0% |
| P | 28784 | 6.2% |
| S | 27669 | 6.0% |
| B | 26226 | 5.7% |
| W | 25662 | 5.5% |
| L | 17495 | 3.8% |
| Other values (23) | 135458 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 9038 | |
| . | 504 | 5.2% |
| ' | 174 | 1.8% |
| ! | 43 | 0.4% |
| ? | 5 | 0.1% |
| & | 1 | < 0.1% |
| : | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 9 | 3 | |
| 6 | 3 | |
| 4 | 2 | 12.5% |
| 0 | 1 | 6.2% |
| 3 | 1 | 6.2% |
| 5 | 1 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 190918 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5566 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 90 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 90 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1416707 | |
| Common | 206447 | 12.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 147790 | 10.4% |
| n | 106027 | 7.5% |
| a | 98234 | 6.9% |
| r | 73830 | 5.2% |
| o | 68233 | 4.8% |
| J | 57762 | 4.1% |
| i | 54850 | 3.9% |
| s | 53134 | 3.8% |
| l | 50757 | 3.6% |
| d | 45593 | 3.2% |
| Other values (72) | 660497 |
Common
| Value | Count | Frequency (%) |
| 190918 | ||
| ; | 9038 | 4.4% |
| - | 5566 | 2.7% |
| . | 504 | 0.2% |
| ' | 174 | 0.1% |
| ( | 90 | < 0.1% |
| ) | 90 | < 0.1% |
| ! | 43 | < 0.1% |
| ? | 5 | < 0.1% |
| 1 | 5 | < 0.1% |
| Other values (9) | 14 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1620015 | |
| None | 3139 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 190918 | 11.8% | |
| e | 147790 | 9.1% |
| n | 106027 | 6.5% |
| a | 98234 | 6.1% |
| r | 73830 | 4.6% |
| o | 68233 | 4.2% |
| J | 57762 | 3.6% |
| i | 54850 | 3.4% |
| s | 53134 | 3.3% |
| l | 50757 | 3.1% |
| Other values (61) | 718480 |
None
| Value | Count | Frequency (%) |
| é | 968 | |
| á | 650 | |
| í | 358 | 11.4% |
| ö | 307 | 9.8% |
| ü | 216 | 6.9% |
| ñ | 87 | 2.8% |
| è | 84 | 2.7% |
| ä | 71 | 2.3% |
| ó | 61 | 1.9% |
| õ | 49 | 1.6% |
| Other values (20) | 288 | 9.2% |
dateIdentified
Text
Missing 
| Distinct | 8706 |
|---|---|
| Distinct (%) | 12.0% |
| Missing | 763698 |
| Missing (%) | 91.3% |
| Memory size | 6.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 3788 ? |
|---|---|
| Unique (%) | 5.2% |
Sample
| 1st row | 1956-11-22T00:00:00 |
|---|---|
| 2nd row | 1995-09-27T00:00:00 |
| 3rd row | 1968-07-01T00:00:00 |
| 4th row | 1972-06-01T00:00:00 |
| 5th row | 1957-01-18T00:00:00 |
| Value | Count | Frequency (%) |
| 1955-03-01t00:00:00 | 346 | 0.5% |
| 1968-07-01t00:00:00 | 330 | 0.5% |
| 1972-06-01t00:00:00 | 328 | 0.5% |
| 1995-10-01t00:00:00 | 275 | 0.4% |
| 2001-12-01t00:00:00 | 275 | 0.4% |
| 1989-08-01t00:00:00 | 248 | 0.3% |
| 2000-01-01t00:00:00 | 236 | 0.3% |
| 2000-06-01t00:00:00 | 233 | 0.3% |
| 1989-04-01t00:00:00 | 222 | 0.3% |
| 2001-03-01t00:00:00 | 221 | 0.3% |
| Other values (8696) | 69797 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 616792 | |
| 1 | 146391 | 10.6% |
| - | 145022 | 10.5% |
| : | 145022 | 10.5% |
| T | 72511 | 5.3% |
| 9 | 65162 | 4.7% |
| 2 | 65099 | 4.7% |
| 8 | 23958 | 1.7% |
| 7 | 22083 | 1.6% |
| 5 | 20282 | 1.5% |
| Other values (3) | 55387 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1015154 | |
| Dash Punctuation | 145022 | 10.5% |
| Other Punctuation | 145022 | 10.5% |
| Uppercase Letter | 72511 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 616792 | |
| 1 | 146391 | 14.4% |
| 9 | 65162 | 6.4% |
| 2 | 65099 | 6.4% |
| 8 | 23958 | 2.4% |
| 7 | 22083 | 2.2% |
| 5 | 20282 | 2.0% |
| 6 | 20115 | 2.0% |
| 3 | 18157 | 1.8% |
| 4 | 17115 | 1.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 145022 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 145022 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 72511 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1305198 | |
| Latin | 72511 | 5.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 616792 | |
| 1 | 146391 | 11.2% |
| - | 145022 | 11.1% |
| : | 145022 | 11.1% |
| 9 | 65162 | 5.0% |
| 2 | 65099 | 5.0% |
| 8 | 23958 | 1.8% |
| 7 | 22083 | 1.7% |
| 5 | 20282 | 1.6% |
| 6 | 20115 | 1.5% |
| Other values (2) | 35272 | 2.7% |
Latin
| Value | Count | Frequency (%) |
| T | 72511 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1377709 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 616792 | |
| 1 | 146391 | 10.6% |
| - | 145022 | 10.5% |
| : | 145022 | 10.5% |
| T | 72511 | 5.3% |
| 9 | 65162 | 4.7% |
| 2 | 65099 | 4.7% |
| 8 | 23958 | 1.7% |
| 7 | 22083 | 1.6% |
| 5 | 20282 | 1.5% |
| Other values (3) | 55387 | 4.0% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 62 |
| Mean length | 62 |
| Min length | 62 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Fungi|Lichenes-Lecanoromycetes|Caliciales|Lichenes-Physciaceae |
|---|
| Value | Count | Frequency (%) |
| fungi|lichenes-lecanoromycetes|caliciales|lichenes-physciaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 10 | |
| c | 7 | |
| i | 6 | |
| s | 5 | 8.1% |
| a | 5 | 8.1% |
| n | 4 | 6.5% |
| | | 3 | 4.8% |
| L | 3 | 4.8% |
| h | 3 | 4.8% |
| y | 2 | 3.2% |
| Other values (11) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51 | |
| Uppercase Letter | 6 | 9.7% |
| Math Symbol | 3 | 4.8% |
| Dash Punctuation | 2 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10 | |
| c | 7 | |
| i | 6 | |
| s | 5 | |
| a | 5 | |
| n | 4 | 7.8% |
| h | 3 | 5.9% |
| y | 2 | 3.9% |
| l | 2 | 3.9% |
| o | 2 | 3.9% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 3 | |
| C | 1 | 16.7% |
| F | 1 | 16.7% |
| P | 1 | 16.7% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 3 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57 | |
| Common | 5 | 8.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10 | |
| c | 7 | |
| i | 6 | |
| s | 5 | |
| a | 5 | |
| n | 4 | 7.0% |
| L | 3 | 5.3% |
| h | 3 | 5.3% |
| y | 2 | 3.5% |
| l | 2 | 3.5% |
| Other values (9) | 10 |
Common
| Value | Count | Frequency (%) |
| | | 3 | |
| - | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 10 | |
| c | 7 | |
| i | 6 | |
| s | 5 | 8.1% |
| a | 5 | 8.1% |
| n | 4 | 6.5% |
| | | 3 | 4.8% |
| L | 3 | 4.8% |
| h | 3 | 4.8% |
| y | 2 | 3.2% |
| Other values (11) | 14 |
identificationVerificationStatus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Fungi |
|---|
| Value | Count | Frequency (%) |
| fungi | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 1 | |
| u | 1 | |
| n | 1 | |
| g | 1 | |
| i | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 | |
| Uppercase Letter | 1 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1 | |
| n | 1 | |
| g | 1 | |
| i | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 1 | |
| u | 1 | |
| n | 1 | |
| g | 1 | |
| i | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 1 | |
| u | 1 | |
| n | 1 | |
| g | 1 | |
| i | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Ascomycota |
|---|
| Value | Count | Frequency (%) |
| ascomycota | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| A | 1 | |
| s | 1 | |
| m | 1 | |
| y | 1 | |
| t | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Uppercase Letter | 1 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| s | 1 | |
| m | 1 | |
| y | 1 | |
| t | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| A | 1 | |
| s | 1 | |
| m | 1 | |
| y | 1 | |
| t | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| A | 1 | |
| s | 1 | |
| m | 1 | |
| y | 1 | |
| t | 1 | |
| a | 1 |
taxonID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Lecanoromycetes |
|---|
| Value | Count | Frequency (%) |
| lecanoromycetes | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| c | 2 | |
| o | 2 | |
| L | 1 | 6.7% |
| a | 1 | 6.7% |
| n | 1 | 6.7% |
| r | 1 | 6.7% |
| m | 1 | 6.7% |
| y | 1 | 6.7% |
| t | 1 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Uppercase Letter | 1 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| c | 2 | |
| o | 2 | |
| a | 1 | 7.1% |
| n | 1 | 7.1% |
| r | 1 | 7.1% |
| m | 1 | 7.1% |
| y | 1 | 7.1% |
| t | 1 | 7.1% |
| s | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| c | 2 | |
| o | 2 | |
| L | 1 | 6.7% |
| a | 1 | 6.7% |
| n | 1 | 6.7% |
| r | 1 | 6.7% |
| m | 1 | 6.7% |
| y | 1 | 6.7% |
| t | 1 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| c | 2 | |
| o | 2 | |
| L | 1 | 6.7% |
| a | 1 | 6.7% |
| n | 1 | 6.7% |
| r | 1 | 6.7% |
| m | 1 | 6.7% |
| y | 1 | 6.7% |
| t | 1 | 6.7% |
scientificNameID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Caliciales |
|---|
| Value | Count | Frequency (%) |
| caliciales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| C | 1 | |
| c | 1 | |
| e | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Uppercase Letter | 1 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| c | 1 | |
| e | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| C | 1 | |
| c | 1 | |
| e | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| C | 1 | |
| c | 1 | |
| e | 1 | |
| s | 1 |
| Distinct | 126970 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 48 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.962477322 |
| Min length | 1 |
Unique
| Unique | 51536 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | 3189695 |
|---|---|
| 2nd row | 4097456 |
| 3rd row | 3189695 |
| 4th row | 5284426 |
| 5th row | 3189695 |
| Value | Count | Frequency (%) |
| 6 | 3615 | 0.4% |
| 329 | 1484 | 0.2% |
| 3177662 | 1278 | 0.2% |
| 2919963 | 968 | 0.1% |
| 9458333 | 756 | 0.1% |
| 3189556 | 710 | 0.1% |
| 3061139 | 634 | 0.1% |
| 3029010 | 607 | 0.1% |
| 3033976 | 605 | 0.1% |
| 3065 | 604 | 0.1% |
| Other values (126960) | 824900 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5821752 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5821752 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5821752 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physciaceae |
|---|
| Value | Count | Frequency (%) |
| physciaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| e | 2 | |
| P | 1 | |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| i | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 1 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| e | 2 | |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| i | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| e | 2 | |
| P | 1 | |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| i | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| e | 2 | |
| P | 1 | |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| i | 1 |
taxonConceptID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physcia |
|---|
| Value | Count | Frequency (%) |
| physcia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1 | |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| c | 1 | |
| i | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| c | 1 | |
| i | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| c | 1 | |
| i | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 1 | |
| h | 1 | |
| y | 1 | |
| s | 1 | |
| c | 1 | |
| i | 1 | |
| a | 1 |
scientificName
Text
| Distinct | 160037 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 122 |
|---|---|
| Median length | 83 |
| Mean length | 28.67196559 |
| Min length | 5 |
Unique
| Unique | 76667 ? |
|---|---|
| Unique (%) | 9.2% |
Sample
| 1st row | Plantago L. |
|---|---|
| 2nd row | Shorea platycarpa F.Heim |
| 3rd row | Plantago L. |
| 4th row | Agathis borneensis Warb. |
| 5th row | Plantago L. |
| Value | Count | Frequency (%) |
| l | 225133 | 7.4% |
| 62415 | 2.1% | |
| ex | 50021 | 1.6% |
| blume | 32940 | 1.1% |
| var | 30936 | 1.0% |
| subsp | 23446 | 0.8% |
| dc | 18796 | 0.6% |
| benth | 14486 | 0.5% |
| miq | 12616 | 0.4% |
| willd | 10991 | 0.4% |
| Other values (67507) | 2550600 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2206652 | 9.2% |
| 2196172 | 9.2% | |
| i | 1707484 | 7.1% |
| e | 1494652 | 6.2% |
| r | 1330069 | 5.5% |
| l | 1202053 | 5.0% |
| s | 1153252 | 4.8% |
| o | 1128720 | 4.7% |
| . | 1104465 | 4.6% |
| u | 1070563 | 4.5% |
| Other values (105) | 9381645 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17702223 | |
| Uppercase Letter | 2299837 | 9.6% |
| Space Separator | 2196172 | 9.2% |
| Other Punctuation | 1183273 | 4.9% |
| Close Punctuation | 264993 | 1.1% |
| Open Punctuation | 264993 | 1.1% |
| Decimal Number | 49356 | 0.2% |
| Dash Punctuation | 10840 | < 0.1% |
| Math Symbol | 4027 | < 0.1% |
| Connector Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2206652 | |
| i | 1707484 | 9.6% |
| e | 1494652 | 8.4% |
| r | 1330069 | 7.5% |
| l | 1202053 | 6.8% |
| s | 1153252 | 6.5% |
| o | 1128720 | 6.4% |
| u | 1070563 | 6.0% |
| n | 1065027 | 6.0% |
| t | 874530 | 4.9% |
| Other values (49) | 4469221 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 328795 | |
| C | 203205 | 8.8% |
| S | 191787 | 8.3% |
| B | 176982 | 7.7% |
| P | 147829 | 6.4% |
| M | 144977 | 6.3% |
| A | 143506 | 6.2% |
| H | 124768 | 5.4% |
| D | 116747 | 5.1% |
| R | 109199 | 4.7% |
| Other values (26) | 612042 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 14588 | |
| 8 | 10406 | |
| 9 | 5352 | 10.8% |
| 4 | 3245 | 6.6% |
| 7 | 3237 | 6.6% |
| 3 | 3172 | 6.4% |
| 2 | 3063 | 6.2% |
| 0 | 2636 | 5.3% |
| 5 | 2070 | 4.2% |
| 6 | 1587 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1104465 | |
| & | 62415 | 5.3% |
| , | 14943 | 1.3% |
| ' | 1450 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2196172 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 264993 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 264993 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10840 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 4027 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20002060 | |
| Common | 3973667 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2206652 | 11.0% |
| i | 1707484 | 8.5% |
| e | 1494652 | 7.5% |
| r | 1330069 | 6.6% |
| l | 1202053 | 6.0% |
| s | 1153252 | 5.8% |
| o | 1128720 | 5.6% |
| u | 1070563 | 5.4% |
| n | 1065027 | 5.3% |
| t | 874530 | 4.4% |
| Other values (85) | 6769058 |
Common
| Value | Count | Frequency (%) |
| 2196172 | ||
| . | 1104465 | |
| ) | 264993 | 6.7% |
| ( | 264993 | 6.7% |
| & | 62415 | 1.6% |
| , | 14943 | 0.4% |
| 1 | 14588 | 0.4% |
| - | 10840 | 0.3% |
| 8 | 10406 | 0.3% |
| 9 | 5352 | 0.1% |
| Other values (10) | 24500 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23942518 | |
| None | 33209 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2206652 | 9.2% |
| 2196172 | 9.2% | |
| i | 1707484 | 7.1% |
| e | 1494652 | 6.2% |
| r | 1330069 | 5.6% |
| l | 1202053 | 5.0% |
| s | 1153252 | 4.8% |
| o | 1128720 | 4.7% |
| . | 1104465 | 4.6% |
| u | 1070563 | 4.5% |
| Other values (61) | 9348436 |
None
| Value | Count | Frequency (%) |
| ü | 13768 | |
| é | 7350 | |
| × | 4027 | 12.1% |
| ö | 2099 | 6.3% |
| ä | 1305 | 3.9% |
| á | 858 | 2.6% |
| ó | 794 | 2.4% |
| è | 716 | 2.2% |
| ø | 510 | 1.5% |
| ê | 209 | 0.6% |
| Other values (34) | 1573 | 4.7% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | caesia |
|---|
| Value | Count | Frequency (%) |
| caesia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| c | 1 | |
| e | 1 | |
| s | 1 | |
| i | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| c | 1 | |
| e | 1 | |
| s | 1 | |
| i | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| c | 1 | |
| e | 1 | |
| s | 1 | |
| i | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| c | 1 | |
| e | 1 | |
| s | 1 | |
| i | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | SPECIES |
|---|
| Value | Count | Frequency (%) |
| species | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
| Distinct | 1179 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 93 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 79 |
|---|---|
| Median length | 67 |
| Mean length | 29.8587995 |
| Min length | 9 |
Unique
| Unique | 125 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantae|Lamiales|Plantaginaceae |
|---|---|
| 2nd row | Plantae|Malvales|Dipterocarpaceae |
| 3rd row | Plantae|Lamiales|Plantaginaceae |
| 4th row | Plantae|Cupressales|Araucariaceae |
| 5th row | Plantae|Lamiales|Plantaginaceae |
| Value | Count | Frequency (%) |
| plantae|fabales|fabaceae | 52176 | 6.2% |
| plantae|asterales|asteraceae | 51643 | 6.2% |
| plantae|poales|poaceae | 43654 | 5.2% |
| plantae|gentianales|rubiaceae | 32700 | 3.9% |
| plantae|poales|cyperaceae | 22272 | 2.7% |
| plantae|lamiales|lamiaceae | 20217 | 2.4% |
| plantae|rosales|rosaceae | 19433 | 2.3% |
| plantae|asparagales|orchidaceae | 16003 | 1.9% |
| plantae|malpighiales|euphorbiaceae | 15183 | 1.8% |
| plantae|malvales|malvaceae | 13567 | 1.6% |
| Other values (1176) | 551185 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5102548 | |
| e | 3832109 | |
| l | 2185070 | |
| | | 1696429 | 6.8% |
| n | 1366109 | 5.5% |
| t | 1256578 | 5.0% |
| s | 1222815 | 4.9% |
| c | 1172986 | 4.7% |
| P | 1048270 | 4.2% |
| i | 944495 | 3.8% |
| Other values (47) | 5138011 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20676808 | |
| Uppercase Letter | 2556492 | 10.2% |
| Math Symbol | 1696429 | 6.8% |
| Dash Punctuation | 28115 | 0.1% |
| Other Punctuation | 5659 | < 0.1% |
| Space Separator | 1917 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5102548 | |
| e | 3832109 | |
| l | 2185070 | |
| n | 1366109 | 6.6% |
| t | 1256578 | 6.1% |
| s | 1222815 | 5.9% |
| c | 1172986 | 5.7% |
| i | 944495 | 4.6% |
| r | 705505 | 3.4% |
| o | 683724 | 3.3% |
| Other values (16) | 2204869 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1048270 | |
| A | 258499 | 10.1% |
| M | 191774 | 7.5% |
| C | 169318 | 6.6% |
| F | 162045 | 6.3% |
| L | 132794 | 5.2% |
| R | 131130 | 5.1% |
| S | 98924 | 3.9% |
| G | 76556 | 3.0% |
| E | 71800 | 2.8% |
| Other values (16) | 215382 | 8.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 3255 | |
| . | 2404 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1696429 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 28115 |
Space Separator
| Value | Count | Frequency (%) |
| 1917 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23233300 | |
| Common | 1732120 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5102548 | |
| e | 3832109 | |
| l | 2185070 | |
| n | 1366109 | 5.9% |
| t | 1256578 | 5.4% |
| s | 1222815 | 5.3% |
| c | 1172986 | 5.0% |
| P | 1048270 | 4.5% |
| i | 944495 | 4.1% |
| r | 705505 | 3.0% |
| Other values (42) | 4396815 |
Common
| Value | Count | Frequency (%) |
| | | 1696429 | |
| - | 28115 | 1.6% |
| ? | 3255 | 0.2% |
| . | 2404 | 0.1% |
| 1917 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24965420 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5102548 | |
| e | 3832109 | |
| l | 2185070 | |
| | | 1696429 | 6.8% |
| n | 1366109 | 5.5% |
| t | 1256578 | 5.0% |
| s | 1222815 | 4.9% |
| c | 1172986 | 4.7% |
| P | 1048270 | 4.2% |
| i | 944495 | 3.8% |
| Other values (47) | 5138011 |
kingdom
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 6.97992124 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| 3rd row | Plantae |
| 4th row | Plantae |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 810590 | |
| fungi | 16418 | 2.0% |
| chromista | 6571 | 0.8% |
| bacteria | 2508 | 0.3% |
| protozoa | 73 | < 0.1% |
| incertae | 46 | < 0.1% |
| sedis | 46 | < 0.1% |
| animalia | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1632888 | |
| n | 827055 | |
| t | 819788 | |
| e | 813236 | |
| P | 810663 | |
| l | 810591 | |
| i | 25591 | 0.4% |
| F | 16418 | 0.3% |
| u | 16418 | 0.3% |
| g | 16418 | 0.3% |
| Other values (12) | 47593 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5000452 | |
| Uppercase Letter | 836161 | 14.3% |
| Space Separator | 46 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1632888 | |
| n | 827055 | |
| t | 819788 | |
| e | 813236 | |
| l | 810591 | |
| i | 25591 | 0.5% |
| u | 16418 | 0.3% |
| g | 16418 | 0.3% |
| r | 9198 | 0.2% |
| o | 6790 | 0.1% |
| Other values (6) | 22479 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 810663 | |
| F | 16418 | 2.0% |
| C | 6571 | 0.8% |
| B | 2508 | 0.3% |
| A | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 46 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5836613 | |
| Common | 46 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1632888 | |
| n | 827055 | |
| t | 819788 | |
| e | 813236 | |
| P | 810663 | |
| l | 810591 | |
| i | 25591 | 0.4% |
| F | 16418 | 0.3% |
| u | 16418 | 0.3% |
| g | 16418 | 0.3% |
| Other values (11) | 47547 | 0.8% |
Common
| Value | Count | Frequency (%) |
| 46 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5836659 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1632888 | |
| n | 827055 | |
| t | 819788 | |
| e | 813236 | |
| P | 810663 | |
| l | 810591 | |
| i | 25591 | 0.4% |
| F | 16418 | 0.3% |
| u | 16418 | 0.3% |
| g | 16418 | 0.3% |
| Other values (12) | 47593 | 0.8% |
phylum
Text
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4178 |
| Missing (%) | 0.5% |
| Memory size | 6.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 12 |
| Mean length | 11.90524151 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Tracheophyta |
|---|---|
| 2nd row | Tracheophyta |
| 3rd row | Tracheophyta |
| 4th row | Tracheophyta |
| 5th row | Tracheophyta |
| Value | Count | Frequency (%) |
| tracheophyta | 772865 | |
| rhodophyta | 12168 | 1.5% |
| bryophyta | 10128 | 1.2% |
| basidiomycota | 8611 | 1.0% |
| chlorophyta | 7692 | 0.9% |
| ascomycota | 7630 | 0.9% |
| ochrophyta | 6511 | 0.8% |
| cyanobacteria | 2493 | 0.3% |
| charophyta | 2034 | 0.2% |
| marchantiophyta | 1729 | 0.2% |
| Other values (15) | 170 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1623994 | |
| h | 1616162 | |
| o | 868326 | |
| y | 842138 | |
| t | 833784 | |
| p | 813147 | |
| c | 807626 | |
| r | 803498 | |
| e | 775465 | |
| T | 772865 | |
| Other values (22) | 148525 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9073497 | |
| Uppercase Letter | 832033 | 8.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1623994 | |
| h | 1616162 | |
| o | 868326 | |
| y | 842138 | |
| t | 833784 | |
| p | 813147 | |
| c | 807626 | |
| r | 803498 | |
| e | 775465 | |
| i | 21460 | 0.2% |
| Other values (9) | 67897 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 772865 | |
| B | 18739 | 2.3% |
| C | 12224 | 1.5% |
| R | 12168 | 1.5% |
| A | 7647 | 0.9% |
| O | 6558 | 0.8% |
| M | 1809 | 0.2% |
| E | 10 | < 0.1% |
| P | 8 | < 0.1% |
| H | 2 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9905530 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1623994 | |
| h | 1616162 | |
| o | 868326 | |
| y | 842138 | |
| t | 833784 | |
| p | 813147 | |
| c | 807626 | |
| r | 803498 | |
| e | 775465 | |
| T | 772865 | |
| Other values (22) | 148525 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9905530 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1623994 | |
| h | 1616162 | |
| o | 868326 | |
| y | 842138 | |
| t | 833784 | |
| p | 813147 | |
| c | 807626 | |
| r | 803498 | |
| e | 775465 | |
| T | 772865 | |
| Other values (22) | 148525 | 1.5% |
class
Text
| Distinct | 76 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4394 |
| Missing (%) | 0.5% |
| Memory size | 6.4 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 13 |
| Mean length | 12.58699591 |
| Min length | 7 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Magnoliopsida |
|---|---|
| 2nd row | Magnoliopsida |
| 3rd row | Magnoliopsida |
| 4th row | Pinopsida |
| 5th row | Magnoliopsida |
| Value | Count | Frequency (%) |
| magnoliopsida | 602481 | |
| liliopsida | 124141 | 14.9% |
| polypodiopsida | 37585 | 4.5% |
| florideophyceae | 11412 | 1.4% |
| bryopsida | 9385 | 1.1% |
| agaricomycetes | 8120 | 1.0% |
| phaeophyceae | 5400 | 0.6% |
| ulvophyceae | 5312 | 0.6% |
| lecanoromycetes | 4420 | 0.5% |
| lycopodiopsida | 3965 | 0.5% |
| Other values (66) | 19594 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1713580 | |
| o | 1539272 | |
| a | 1448471 | |
| p | 855262 | |
| d | 839665 | |
| s | 801318 | |
| l | 785417 | |
| n | 621120 | 5.9% |
| g | 613876 | 5.9% |
| M | 602686 | 5.8% |
| Other values (33) | 649385 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9638230 | |
| Uppercase Letter | 831822 | 7.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1713580 | |
| o | 1539272 | |
| a | 1448471 | |
| p | 855262 | |
| d | 839665 | |
| s | 801318 | |
| l | 785417 | |
| n | 621120 | 6.4% |
| g | 613876 | 6.4% |
| e | 118229 | 1.2% |
| Other values (13) | 302020 | 3.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 602686 | |
| L | 133228 | 16.0% |
| P | 48442 | 5.8% |
| F | 11412 | 1.4% |
| B | 10605 | 1.3% |
| A | 8316 | 1.0% |
| C | 6387 | 0.8% |
| U | 5327 | 0.6% |
| J | 1598 | 0.2% |
| S | 1174 | 0.1% |
| Other values (10) | 2647 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10470052 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1713580 | |
| o | 1539272 | |
| a | 1448471 | |
| p | 855262 | |
| d | 839665 | |
| s | 801318 | |
| l | 785417 | |
| n | 621120 | 5.9% |
| g | 613876 | 5.9% |
| M | 602686 | 5.8% |
| Other values (33) | 649385 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10470052 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1713580 | |
| o | 1539272 | |
| a | 1448471 | |
| p | 855262 | |
| d | 839665 | |
| s | 801318 | |
| l | 785417 | |
| n | 621120 | 5.9% |
| g | 613876 | 5.9% |
| M | 602686 | 5.8% |
| Other values (33) | 649385 | 6.2% |
order
Text
| Distinct | 379 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6630 |
| Missing (%) | 0.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 17 |
| Mean length | 9.454319601 |
| Min length | 6 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lamiales |
|---|---|
| 2nd row | Malvales |
| 3rd row | Lamiales |
| 4th row | Pinales |
| 5th row | Lamiales |
| Value | Count | Frequency (%) |
| poales | 73718 | 8.9% |
| asterales | 57572 | 6.9% |
| malpighiales | 56399 | 6.8% |
| fabales | 55446 | 6.7% |
| lamiales | 55104 | 6.6% |
| gentianales | 52371 | 6.3% |
| rosales | 40401 | 4.9% |
| ericales | 30937 | 3.7% |
| caryophyllales | 30623 | 3.7% |
| polypodiales | 28749 | 3.5% |
| Other values (369) | 348259 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1361922 | |
| l | 1101125 | |
| s | 1014961 | |
| e | 1006510 | |
| i | 484809 | 6.2% |
| o | 286851 | 3.7% |
| r | 285506 | 3.6% |
| n | 249315 | 3.2% |
| p | 211179 | 2.7% |
| t | 191436 | 2.4% |
| Other values (39) | 1649491 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7013526 | |
| Uppercase Letter | 829579 | 10.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1361922 | |
| l | 1101125 | |
| s | 1014961 | |
| e | 1006510 | |
| i | 484809 | 6.9% |
| o | 286851 | 4.1% |
| r | 285506 | 4.1% |
| n | 249315 | 3.6% |
| p | 211179 | 3.0% |
| t | 191436 | 2.7% |
| Other values (15) | 819912 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 120857 | |
| P | 120381 | |
| A | 115430 | |
| L | 73180 | |
| F | 64704 | |
| G | 61065 | |
| C | 61049 | |
| S | 55307 | |
| R | 55091 | |
| E | 34231 | 4.1% |
| Other values (14) | 68284 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7843105 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1361922 | |
| l | 1101125 | |
| s | 1014961 | |
| e | 1006510 | |
| i | 484809 | 6.2% |
| o | 286851 | 3.7% |
| r | 285506 | 3.6% |
| n | 249315 | 3.2% |
| p | 211179 | 2.7% |
| t | 191436 | 2.4% |
| Other values (39) | 1649491 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7843105 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1361922 | |
| l | 1101125 | |
| s | 1014961 | |
| e | 1006510 | |
| i | 484809 | 6.2% |
| o | 286851 | 3.7% |
| r | 285506 | 3.6% |
| n | 249315 | 3.2% |
| p | 211179 | 2.7% |
| t | 191436 | 2.4% |
| Other values (39) | 1649491 |
family
Text
| Distinct | 1416 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 6695 |
| Missing (%) | 0.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 21 |
| Mean length | 10.77436909 |
| Min length | 7 |
Unique
| Unique | 176 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantaginaceae |
|---|---|
| 2nd row | Dipterocarpaceae |
| 3rd row | Plantaginaceae |
| 4th row | Araucariaceae |
| 5th row | Plantaginaceae |
| Value | Count | Frequency (%) |
| fabaceae | 52179 | 6.3% |
| asteraceae | 51696 | 6.2% |
| poaceae | 43659 | 5.3% |
| rubiaceae | 32694 | 3.9% |
| cyperaceae | 22275 | 2.7% |
| lamiaceae | 20240 | 2.4% |
| rosaceae | 19433 | 2.3% |
| orchidaceae | 15993 | 1.9% |
| euphorbiaceae | 15181 | 1.8% |
| malvaceae | 13704 | 1.7% |
| Other values (1406) | 542460 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2080969 | |
| e | 1910674 | |
| c | 993402 | |
| i | 393681 | 4.4% |
| r | 388909 | 4.4% |
| o | 332488 | 3.7% |
| n | 276346 | 3.1% |
| l | 274366 | 3.1% |
| t | 234156 | 2.6% |
| s | 176824 | 2.0% |
| Other values (52) | 1875675 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8107939 | |
| Uppercase Letter | 829518 | 9.3% |
| Decimal Number | 24 | < 0.1% |
| Connector Punctuation | 5 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2080969 | |
| e | 1910674 | |
| c | 993402 | |
| i | 393681 | 4.9% |
| r | 388909 | 4.8% |
| o | 332488 | 4.1% |
| n | 276346 | 3.4% |
| l | 274366 | 3.4% |
| t | 234156 | 2.9% |
| s | 176824 | 2.2% |
| Other values (16) | 1046124 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 133823 | |
| P | 118188 | |
| C | 94383 | |
| R | 75706 | |
| M | 62478 | |
| F | 58063 | |
| L | 46714 | 5.6% |
| S | 42631 | 5.1% |
| E | 35407 | 4.3% |
| B | 33606 | 4.1% |
| Other values (15) | 128519 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 4 | 5 | |
| 2 | 4 | |
| 5 | 2 | 8.3% |
| 8 | 2 | 8.3% |
| 6 | 2 | 8.3% |
| 9 | 1 | 4.2% |
| 7 | 1 | 4.2% |
| 0 | 1 | 4.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8937457 | |
| Common | 33 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2080969 | |
| e | 1910674 | |
| c | 993402 | |
| i | 393681 | 4.4% |
| r | 388909 | 4.4% |
| o | 332488 | 3.7% |
| n | 276346 | 3.1% |
| l | 274366 | 3.1% |
| t | 234156 | 2.6% |
| s | 176824 | 2.0% |
| Other values (41) | 1875642 |
Common
| Value | Count | Frequency (%) |
| 1 | 6 | |
| _ | 5 | |
| 4 | 5 | |
| - | 4 | |
| 2 | 4 | |
| 5 | 2 | 6.1% |
| 8 | 2 | 6.1% |
| 6 | 2 | 6.1% |
| 9 | 1 | 3.0% |
| 7 | 1 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8937490 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2080969 | |
| e | 1910674 | |
| c | 993402 | |
| i | 393681 | 4.4% |
| r | 388909 | 4.4% |
| o | 332488 | 3.7% |
| n | 276346 | 3.1% |
| l | 274366 | 3.1% |
| t | 234156 | 2.6% |
| s | 176824 | 2.0% |
| Other values (52) | 1875675 |
subfamily
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | NL |
|---|
| Value | Count | Frequency (%) |
| nl | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1 | |
| L | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1 | |
| L | 1 |
tribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-11-01T10:28:05.946Z |
|---|
| Value | Count | Frequency (%) |
| 2024-11-01t10:28:05.946z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 4 | |
| 2 | 3 | |
| 4 | 2 | |
| - | 2 | |
| : | 2 | |
| T | 1 | 4.2% |
| 8 | 1 | 4.2% |
| 5 | 1 | 4.2% |
| . | 1 | 4.2% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17 | |
| Other Punctuation | 3 | 12.5% |
| Dash Punctuation | 2 | 8.3% |
| Uppercase Letter | 2 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 4 | |
| 2 | 3 | |
| 4 | 2 | |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 9 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22 | |
| Latin | 2 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 4 | |
| 2 | 3 | |
| 4 | 2 | |
| - | 2 | |
| : | 2 | |
| 8 | 1 | 4.5% |
| 5 | 1 | 4.5% |
| . | 1 | 4.5% |
| 9 | 1 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 4 | |
| 2 | 3 | |
| 4 | 2 | |
| - | 2 | |
| : | 2 | |
| T | 1 | 4.2% |
| 8 | 1 | 4.2% |
| 5 | 1 | 4.2% |
| . | 1 | 4.2% |
| Other values (3) | 3 |
genus
Text
Missing 
| Distinct | 13976 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 13165 |
| Missing (%) | 1.6% |
| Memory size | 6.4 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 8.628420838 |
| Min length | 2 |
Unique
| Unique | 2459 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Plantago |
|---|---|
| 2nd row | Shorea |
| 3rd row | Plantago |
| 4th row | Agathis |
| 5th row | Plantago |
| Value | Count | Frequency (%) |
| carex | 9711 | 1.2% |
| ficus | 7339 | 0.9% |
| rubus | 6530 | 0.8% |
| taraxacum | 5291 | 0.6% |
| cyperus | 4059 | 0.5% |
| salix | 3696 | 0.4% |
| ranunculus | 3488 | 0.4% |
| galium | 3355 | 0.4% |
| euphorbia | 3348 | 0.4% |
| asplenium | 3302 | 0.4% |
| Other values (13965) | 772925 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 882159 | 12.4% |
| i | 637415 | 9.0% |
| e | 489322 | 6.9% |
| r | 472496 | 6.7% |
| o | 471851 | 6.6% |
| u | 395415 | 5.6% |
| s | 394685 | 5.6% |
| l | 381894 | 5.4% |
| n | 361563 | 5.1% |
| t | 291795 | 4.1% |
| Other values (43) | 2322975 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6278303 | |
| Uppercase Letter | 823109 | 11.6% |
| Dash Punctuation | 158 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 882159 | |
| i | 637415 | |
| e | 489322 | 7.8% |
| r | 472496 | 7.5% |
| o | 471851 | 7.5% |
| u | 395415 | 6.3% |
| s | 394685 | 6.3% |
| l | 381894 | 6.1% |
| n | 361563 | 5.8% |
| t | 291795 | 4.6% |
| Other values (16) | 1499708 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 112385 | |
| P | 87084 | |
| S | 79350 | 9.6% |
| A | 78110 | 9.5% |
| M | 48629 | 5.9% |
| D | 43750 | 5.3% |
| L | 43489 | 5.3% |
| T | 40242 | 4.9% |
| E | 39409 | 4.8% |
| G | 37186 | 4.5% |
| Other values (16) | 213475 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 158 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7101412 | |
| Common | 158 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 882159 | 12.4% |
| i | 637415 | 9.0% |
| e | 489322 | 6.9% |
| r | 472496 | 6.7% |
| o | 471851 | 6.6% |
| u | 395415 | 5.6% |
| s | 394685 | 5.6% |
| l | 381894 | 5.4% |
| n | 361563 | 5.1% |
| t | 291795 | 4.1% |
| Other values (42) | 2322817 |
Common
| Value | Count | Frequency (%) |
| - | 158 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7101570 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 882159 | 12.4% |
| i | 637415 | 9.0% |
| e | 489322 | 6.9% |
| r | 472496 | 6.7% |
| o | 471851 | 6.6% |
| u | 395415 | 5.6% |
| s | 394685 | 5.6% |
| l | 381894 | 5.4% |
| n | 361563 | 5.1% |
| t | 291795 | 4.1% |
| Other values (43) | 2322975 |
genericName
Text
Missing 
| Distinct | 14992 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 13241 |
| Missing (%) | 1.6% |
| Memory size | 6.4 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 19 |
| Mean length | 8.528952523 |
| Min length | 3 |
Unique
| Unique | 3301 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Plantago |
|---|---|
| 2nd row | Shorea |
| 3rd row | Plantago |
| 4th row | Agathis |
| 5th row | Plantago |
| Value | Count | Frequency (%) |
| carex | 9604 | 1.2% |
| ficus | 7336 | 0.9% |
| rubus | 6531 | 0.8% |
| taraxacum | 5292 | 0.6% |
| hieracium | 4623 | 0.6% |
| salix | 3662 | 0.4% |
| ranunculus | 3636 | 0.4% |
| cyperus | 3522 | 0.4% |
| galium | 3425 | 0.4% |
| juncus | 3251 | 0.4% |
| Other values (14981) | 772086 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 868984 | 12.4% |
| i | 638023 | 9.1% |
| e | 479416 | 6.8% |
| r | 468695 | 6.7% |
| o | 461003 | 6.6% |
| u | 398937 | 5.7% |
| s | 388992 | 5.5% |
| l | 364131 | 5.2% |
| n | 358302 | 5.1% |
| t | 287234 | 4.1% |
| Other values (45) | 2305338 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6196012 | |
| Uppercase Letter | 822985 | 11.7% |
| Dash Punctuation | 58 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 868984 | |
| i | 638023 | |
| e | 479416 | 7.7% |
| r | 468695 | 7.6% |
| o | 461003 | 7.4% |
| u | 398937 | 6.4% |
| s | 388992 | 6.3% |
| l | 364131 | 5.9% |
| n | 358302 | 5.8% |
| t | 287234 | 4.6% |
| Other values (18) | 1482295 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 114403 | |
| P | 84684 | 10.3% |
| A | 79730 | 9.7% |
| S | 79497 | 9.7% |
| M | 48353 | 5.9% |
| D | 44247 | 5.4% |
| L | 43302 | 5.3% |
| E | 40630 | 4.9% |
| T | 40116 | 4.9% |
| H | 35828 | 4.4% |
| Other values (16) | 212195 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 58 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7018997 | |
| Common | 58 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 868984 | 12.4% |
| i | 638023 | 9.1% |
| e | 479416 | 6.8% |
| r | 468695 | 6.7% |
| o | 461003 | 6.6% |
| u | 398937 | 5.7% |
| s | 388992 | 5.5% |
| l | 364131 | 5.2% |
| n | 358302 | 5.1% |
| t | 287234 | 4.1% |
| Other values (44) | 2305280 |
Common
| Value | Count | Frequency (%) |
| - | 58 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7019043 | |
| None | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 868984 | 12.4% |
| i | 638023 | 9.1% |
| e | 479416 | 6.8% |
| r | 468695 | 6.7% |
| o | 461003 | 6.6% |
| u | 398937 | 5.7% |
| s | 388992 | 5.5% |
| l | 364131 | 5.2% |
| n | 358302 | 5.1% |
| t | 287234 | 4.1% |
| Other values (43) | 2305326 |
None
| Value | Count | Frequency (%) |
| ë | 11 | |
| ö | 1 | 8.3% |
specificEpithet
Text
Missing 
| Distinct | 40036 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 78237 |
| Missing (%) | 9.4% |
| Memory size | 6.4 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 20 |
| Mean length | 8.998581742 |
| Min length | 2 |
Unique
| Unique | 13804 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | platycarpa |
|---|---|
| 2nd row | borneensis |
| 3rd row | hopeifolia |
| 4th row | hexandrum |
| 5th row | ovata |
| Value | Count | Frequency (%) |
| vulgaris | 4161 | 0.5% |
| palustris | 3085 | 0.4% |
| arvensis | 2985 | 0.4% |
| officinalis | 2666 | 0.4% |
| indica | 2512 | 0.3% |
| repens | 2282 | 0.3% |
| maritima | 2041 | 0.3% |
| alpina | 1923 | 0.3% |
| vulgare | 1822 | 0.2% |
| javanica | 1815 | 0.2% |
| Other values (40026) | 732680 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 934265 | |
| i | 768666 | |
| s | 511342 | 7.5% |
| e | 476245 | 7.0% |
| r | 451502 | 6.6% |
| l | 444825 | 6.5% |
| n | 423483 | 6.2% |
| u | 419441 | 6.1% |
| o | 390489 | 5.7% |
| t | 360947 | 5.3% |
| Other values (20) | 1639468 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6815901 | |
| Dash Punctuation | 4772 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 934265 | |
| i | 768666 | |
| s | 511342 | 7.5% |
| e | 476245 | 7.0% |
| r | 451502 | 6.6% |
| l | 444825 | 6.5% |
| n | 423483 | 6.2% |
| u | 419441 | 6.2% |
| o | 390489 | 5.7% |
| t | 360947 | 5.3% |
| Other values (19) | 1634696 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4772 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6815901 | |
| Common | 4772 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 934265 | |
| i | 768666 | |
| s | 511342 | 7.5% |
| e | 476245 | 7.0% |
| r | 451502 | 6.6% |
| l | 444825 | 6.5% |
| n | 423483 | 6.2% |
| u | 419441 | 6.2% |
| o | 390489 | 5.7% |
| t | 360947 | 5.3% |
| Other values (19) | 1634696 |
Common
| Value | Count | Frequency (%) |
| - | 4772 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6820639 | |
| None | 34 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 934265 | |
| i | 768666 | |
| s | 511342 | 7.5% |
| e | 476245 | 7.0% |
| r | 451502 | 6.6% |
| l | 444825 | 6.5% |
| n | 423483 | 6.2% |
| u | 419441 | 6.1% |
| o | 390489 | 5.7% |
| t | 360947 | 5.3% |
| Other values (17) | 1639434 |
None
| Value | Count | Frequency (%) |
| ï | 30 | |
| ë | 3 | 8.8% |
| ü | 1 | 2.9% |
Missing 
| Distinct | 9364 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 778925 |
| Missing (%) | 93.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 9.13082187 |
| Min length | 3 |
Unique
| Unique | 4061 ? |
|---|---|
| Unique (%) | 7.1% |
Sample
| 1st row | velutinata |
|---|---|
| 2nd row | mollis |
| 3rd row | sycomoroides |
| 4th row | globifera |
| 5th row | formosum |
| Value | Count | Frequency (%) |
| angustifolia | 326 | 0.6% |
| pubescens | 301 | 0.5% |
| album | 284 | 0.5% |
| vulgaris | 276 | 0.5% |
| glabra | 251 | 0.4% |
| major | 240 | 0.4% |
| vulgare | 236 | 0.4% |
| montana | 210 | 0.4% |
| montanum | 192 | 0.3% |
| repens | 189 | 0.3% |
| Other values (9354) | 54779 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 70401 | |
| i | 58390 | |
| s | 39103 | 7.5% |
| e | 37024 | 7.1% |
| l | 35883 | 6.9% |
| r | 34056 | 6.5% |
| u | 33329 | 6.4% |
| n | 31689 | 6.1% |
| o | 30485 | 5.8% |
| t | 28007 | 5.4% |
| Other values (17) | 124683 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 522937 | |
| Dash Punctuation | 113 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 70401 | |
| i | 58390 | |
| s | 39103 | 7.5% |
| e | 37024 | 7.1% |
| l | 35883 | 6.9% |
| r | 34056 | 6.5% |
| u | 33329 | 6.4% |
| n | 31689 | 6.1% |
| o | 30485 | 5.8% |
| t | 28007 | 5.4% |
| Other values (16) | 124570 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 113 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 522937 | |
| Common | 113 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 70401 | |
| i | 58390 | |
| s | 39103 | 7.5% |
| e | 37024 | 7.1% |
| l | 35883 | 6.9% |
| r | 34056 | 6.5% |
| u | 33329 | 6.4% |
| n | 31689 | 6.1% |
| o | 30485 | 5.8% |
| t | 28007 | 5.4% |
| Other values (16) | 124570 |
Common
| Value | Count | Frequency (%) |
| - | 113 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 523050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 70401 | |
| i | 58390 | |
| s | 39103 | 7.5% |
| e | 37024 | 7.1% |
| l | 35883 | 6.9% |
| r | 34056 | 6.5% |
| u | 33329 | 6.4% |
| n | 31689 | 6.1% |
| o | 30485 | 5.8% |
| t | 28007 | 5.4% |
| Other values (17) | 124683 |
cultivarEpithet
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | true |
|---|
| Value | Count | Frequency (%) |
| true | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
taxonRank
Text
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.906764824 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GENUS |
|---|---|
| 2nd row | SPECIES |
| 3rd row | GENUS |
| 4th row | SPECIES |
| 5th row | GENUS |
| Value | Count | Frequency (%) |
| species | 700762 | |
| genus | 64996 | 7.8% |
| variety | 30937 | 3.7% |
| subspecies | 23447 | 2.8% |
| family | 8933 | 1.1% |
| kingdom | 3825 | 0.5% |
| form | 2902 | 0.3% |
| class | 244 | < 0.1% |
| phylum | 138 | < 0.1% |
| order | 23 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1544374 | |
| S | 1537349 | |
| I | 767904 | |
| C | 724453 | |
| P | 724347 | |
| U | 88581 | 1.5% |
| G | 68821 | 1.2% |
| N | 68821 | 1.2% |
| A | 40114 | 0.7% |
| Y | 40008 | 0.7% |
| Other values (16) | 170720 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5775487 | |
| Lowercase Letter | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1544374 | |
| S | 1537349 | |
| I | 767904 | |
| C | 724453 | |
| P | 724347 | |
| U | 88581 | 1.5% |
| G | 68821 | 1.2% |
| N | 68821 | 1.2% |
| A | 40114 | 0.7% |
| Y | 40008 | 0.7% |
| Other values (11) | 170715 | 3.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5775492 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1544374 | |
| S | 1537349 | |
| I | 767904 | |
| C | 724453 | |
| P | 724347 | |
| U | 88581 | 1.5% |
| G | 68821 | 1.2% |
| N | 68821 | 1.2% |
| A | 40114 | 0.7% |
| Y | 40008 | 0.7% |
| Other values (16) | 170720 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5775492 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1544374 | |
| S | 1537349 | |
| I | 767904 | |
| C | 724453 | |
| P | 724347 | |
| U | 88581 | 1.5% |
| G | 68821 | 1.2% |
| N | 68821 | 1.2% |
| A | 40114 | 0.7% |
| Y | 40008 | 0.7% |
| Other values (16) | 170720 | 3.0% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2608920 |
|---|
| Value | Count | Frequency (%) |
| 2608920 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
vernacularName
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2608920 |
|---|
| Value | Count | Frequency (%) |
| 2608920 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.999997608 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ICN |
|---|---|
| 2nd row | ICN |
| 3rd row | ICN |
| 4th row | ICN |
| 5th row | ICN |
| Value | Count | Frequency (%) |
| icn | 836207 | |
| 5 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 836207 | |
| C | 836207 | |
| N | 836207 | |
| 5 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2508621 | |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 836207 | |
| C | 836207 | |
| N | 836207 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2508621 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 836207 | |
| C | 836207 | |
| N | 836207 |
Common
| Value | Count | Frequency (%) |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2508622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 836207 | |
| C | 836207 | |
| N | 836207 | |
| 5 | 1 | < 0.1% |
taxonomicStatus
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 47 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.766258213 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 630751 | |
| synonym | 195440 | 23.4% |
| doubtful | 9970 | 1.2% |
| 95 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1261502 | |
| C | 1261502 | |
| T | 640721 | |
| D | 640721 | |
| A | 630751 | |
| P | 630751 | |
| Y | 390880 | 6.0% |
| N | 390880 | 6.0% |
| O | 205410 | 3.2% |
| S | 195440 | 3.0% |
| Other values (7) | 245292 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6493848 | |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1261502 | |
| C | 1261502 | |
| T | 640721 | |
| D | 640721 | |
| A | 630751 | |
| P | 630751 | |
| Y | 390880 | 6.0% |
| N | 390880 | 6.0% |
| O | 205410 | 3.2% |
| S | 195440 | 3.0% |
| Other values (5) | 245290 | 3.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 5 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6493848 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1261502 | |
| C | 1261502 | |
| T | 640721 | |
| D | 640721 | |
| A | 630751 | |
| P | 630751 | |
| Y | 390880 | 6.0% |
| N | 390880 | 6.0% |
| O | 205410 | 3.2% |
| S | 195440 | 3.0% |
| Other values (5) | 245290 | 3.8% |
Common
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6493850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1261502 | |
| C | 1261502 | |
| T | 640721 | |
| D | 640721 | |
| A | 630751 | |
| P | 630751 | |
| Y | 390880 | 6.0% |
| N | 390880 | 6.0% |
| O | 205410 | 3.2% |
| S | 195440 | 3.0% |
| Other values (7) | 245292 | 3.8% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 180 |
|---|
| Value | Count | Frequency (%) |
| 180 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 0 | 1 |
taxonRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 10861608 |
|---|
| Value | Count | Frequency (%) |
| 10861608 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 6 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 6 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 6 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 6 | 2 |
datasetKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 35.99996173 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 15f819bd-6612-4447-854b-14d12ee1022d |
|---|---|
| 2nd row | 15f819bd-6612-4447-854b-14d12ee1022d |
| 3rd row | 15f819bd-6612-4447-854b-14d12ee1022d |
| 4th row | 15f819bd-6612-4447-854b-14d12ee1022d |
| 5th row | 15f819bd-6612-4447-854b-14d12ee1022d |
| Value | Count | Frequency (%) |
| 15f819bd-6612-4447-854b-14d12ee1022d | 836207 | |
| 8369 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5017242 | |
| 4 | 4181035 | |
| - | 3344828 | |
| 2 | 3344828 | |
| d | 2508621 | |
| 8 | 1672415 | 5.6% |
| 6 | 1672415 | 5.6% |
| 5 | 1672414 | 5.6% |
| b | 1672414 | 5.6% |
| e | 1672414 | 5.6% |
| Other values (5) | 3344830 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20068972 | |
| Lowercase Letter | 6689656 | 22.2% |
| Dash Punctuation | 3344828 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5017242 | |
| 4 | 4181035 | |
| 2 | 3344828 | |
| 8 | 1672415 | 8.3% |
| 6 | 1672415 | 8.3% |
| 5 | 1672414 | 8.3% |
| 9 | 836208 | 4.2% |
| 7 | 836207 | 4.2% |
| 0 | 836207 | 4.2% |
| 3 | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 2508621 | |
| b | 1672414 | |
| e | 1672414 | |
| f | 836207 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3344828 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23413800 | |
| Latin | 6689656 | 22.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5017242 | |
| 4 | 4181035 | |
| - | 3344828 | |
| 2 | 3344828 | |
| 8 | 1672415 | 7.1% |
| 6 | 1672415 | 7.1% |
| 5 | 1672414 | 7.1% |
| 9 | 836208 | 3.6% |
| 7 | 836207 | 3.6% |
| 0 | 836207 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| d | 2508621 | |
| b | 1672414 | |
| e | 1672414 | |
| f | 836207 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30103456 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5017242 | |
| 4 | 4181035 | |
| - | 3344828 | |
| 2 | 3344828 | |
| d | 2508621 | |
| 8 | 1672415 | 5.6% |
| 6 | 1672415 | 5.6% |
| 5 | 1672414 | 5.6% |
| b | 1672414 | 5.6% |
| e | 1672414 | 5.6% |
| Other values (5) | 3344830 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.000005979 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NL |
|---|---|
| 2nd row | NL |
| 3rd row | NL |
| 4th row | NL |
| 5th row | NL |
| Value | Count | Frequency (%) |
| nl | 836207 | |
| 2600367 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 836207 | |
| L | 836207 | |
| 6 | 2 | < 0.1% |
| 0 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1672414 | |
| Decimal Number | 7 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 0 | 2 | |
| 2 | 1 | |
| 3 | 1 | |
| 7 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 836207 | |
| L | 836207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1672414 | |
| Common | 7 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 0 | 2 | |
| 2 | 1 | |
| 3 | 1 | |
| 7 | 1 |
Latin
| Value | Count | Frequency (%) |
| N | 836207 | |
| L | 836207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1672421 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 836207 | |
| L | 836207 | |
| 6 | 2 | < 0.1% |
| 0 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
lastInterpreted
Text
| Distinct | 151288 |
|---|---|
| Distinct (%) | 18.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99604404 |
| Min length | 20 |
Unique
| Unique | 19524 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | 2024-11-01T10:27:16.300Z |
|---|---|
| 2nd row | 2024-11-01T10:29:04.857Z |
| 3rd row | 2024-11-01T10:27:16.301Z |
| 4th row | 2024-11-01T10:29:41.603Z |
| 5th row | 2024-11-01T10:27:17.382Z |
| Value | Count | Frequency (%) |
| 2024-11-01t10:27:17.419z | 32 | < 0.1% |
| 2024-11-01t10:26:47.509z | 30 | < 0.1% |
| 2024-11-01t10:27:17.556z | 29 | < 0.1% |
| 2024-11-01t10:27:17.502z | 28 | < 0.1% |
| 2024-11-01t10:28:04.529z | 28 | < 0.1% |
| 2024-11-01t10:27:28.429z | 28 | < 0.1% |
| 2024-11-01t10:27:27.691z | 28 | < 0.1% |
| 2024-11-01t10:27:17.495z | 28 | < 0.1% |
| 2024-11-01t10:27:16.167z | 28 | < 0.1% |
| 2024-11-01t10:27:02.841z | 28 | < 0.1% |
| Other values (151278) | 835920 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| - | 1672414 | |
| : | 1672414 | |
| 4 | 1297614 | 6.5% |
| T | 836207 | 4.2% |
| Z | 836207 | 4.2% |
| . | 835380 | 4.2% |
| 7 | 673118 | 3.4% |
| Other values (5) | 2391619 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14213038 | |
| Other Punctuation | 2507794 | 12.5% |
| Dash Punctuation | 1672414 | 8.3% |
| Uppercase Letter | 1672414 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| 4 | 1297614 | 9.1% |
| 7 | 673118 | 4.7% |
| 8 | 599914 | 4.2% |
| 9 | 470980 | 3.3% |
| 5 | 457024 | 3.2% |
| 3 | 449982 | 3.2% |
| 6 | 413719 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1672414 | |
| . | 835380 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 836207 | |
| Z | 836207 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1672414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18393246 | |
| Latin | 1672414 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| - | 1672414 | |
| : | 1672414 | |
| 4 | 1297614 | 7.1% |
| . | 835380 | 4.5% |
| 7 | 673118 | 3.7% |
| 8 | 599914 | 3.3% |
| 9 | 470980 | 2.6% |
| Other values (3) | 1320725 | 7.2% |
Latin
| Value | Count | Frequency (%) |
| T | 836207 | |
| Z | 836207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20065660 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| - | 1672414 | |
| : | 1672414 | |
| 4 | 1297614 | 6.5% |
| T | 836207 | 4.2% |
| Z | 836207 | 4.2% |
| . | 835380 | 4.2% |
| 7 | 673118 | 3.4% |
| Other values (5) | 2391619 |
elevation
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2608920 |
|---|
| Value | Count | Frequency (%) |
| 2608920 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physcia caesia |
|---|
| Value | Count | Frequency (%) |
| physcia | 1 | |
| caesia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| s | 2 | |
| c | 2 | |
| i | 2 | |
| P | 1 | 7.1% |
| h | 1 | 7.1% |
| y | 1 | 7.1% |
| 1 | 7.1% | |
| e | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 1 | 7.1% |
| Space Separator | 1 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| s | 2 | |
| c | 2 | |
| i | 2 | |
| h | 1 | 8.3% |
| y | 1 | 8.3% |
| e | 1 | 8.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 | |
| Common | 1 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| s | 2 | |
| c | 2 | |
| i | 2 | |
| P | 1 | 7.7% |
| h | 1 | 7.7% |
| y | 1 | 7.7% |
| e | 1 | 7.7% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| s | 2 | |
| c | 2 | |
| i | 2 | |
| P | 1 | 7.1% |
| h | 1 | 7.1% |
| y | 1 | 7.1% |
| 1 | 7.1% | |
| e | 1 | 7.1% |
depth
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 30 |
| Mean length | 30 |
| Min length | 30 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physcia caesia (Hoffm.) Fürnr. |
|---|
| Value | Count | Frequency (%) |
| physcia | 1 | |
| caesia | 1 | |
| hoffm | 1 | |
| fürnr | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | 10.0% |
| 3 | 10.0% | |
| r | 2 | 6.7% |
| s | 2 | 6.7% |
| c | 2 | 6.7% |
| i | 2 | 6.7% |
| . | 2 | 6.7% |
| f | 2 | 6.7% |
| P | 1 | 3.3% |
| m | 1 | 3.3% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 | |
| Space Separator | 3 | 10.0% |
| Uppercase Letter | 3 | 10.0% |
| Other Punctuation | 2 | 6.7% |
| Close Punctuation | 1 | 3.3% |
| Open Punctuation | 1 | 3.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| r | 2 | |
| s | 2 | |
| c | 2 | |
| i | 2 | |
| f | 2 | |
| m | 1 | 5.0% |
| ü | 1 | 5.0% |
| o | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| F | 1 | |
| H | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23 | |
| Common | 7 | 23.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| r | 2 | 8.7% |
| s | 2 | 8.7% |
| c | 2 | 8.7% |
| i | 2 | 8.7% |
| f | 2 | 8.7% |
| P | 1 | 4.3% |
| m | 1 | 4.3% |
| ü | 1 | 4.3% |
| F | 1 | 4.3% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| . | 2 | |
| ) | 1 | 14.3% |
| ( | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 | |
| None | 1 | 3.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | 10.3% |
| 3 | 10.3% | |
| r | 2 | 6.9% |
| s | 2 | 6.9% |
| c | 2 | 6.9% |
| i | 2 | 6.9% |
| . | 2 | 6.9% |
| f | 2 | 6.9% |
| P | 1 | 3.4% |
| m | 1 | 3.4% |
| Other values (9) | 9 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
depthAccuracy
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 39 |
| Mean length | 39 |
| Min length | 39 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physcia caesia (Hoffm.) Hampe ex Fürnr. |
|---|
| Value | Count | Frequency (%) |
| physcia | 1 | |
| caesia | 1 | |
| hoffm | 1 | |
| hampe | 1 | |
| ex | 1 | |
| fürnr | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 12.8% | |
| a | 4 | 10.3% |
| e | 3 | 7.7% |
| r | 2 | 5.1% |
| s | 2 | 5.1% |
| c | 2 | 5.1% |
| i | 2 | 5.1% |
| H | 2 | 5.1% |
| f | 2 | 5.1% |
| m | 2 | 5.1% |
| Other values (12) | 13 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26 | |
| Space Separator | 5 | 12.8% |
| Uppercase Letter | 4 | 10.3% |
| Other Punctuation | 2 | 5.1% |
| Close Punctuation | 1 | 2.6% |
| Open Punctuation | 1 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 3 | |
| r | 2 | 7.7% |
| s | 2 | 7.7% |
| c | 2 | 7.7% |
| i | 2 | 7.7% |
| f | 2 | 7.7% |
| m | 2 | 7.7% |
| p | 1 | 3.8% |
| ü | 1 | 3.8% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 2 | |
| P | 1 | |
| F | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30 | |
| Common | 9 | 23.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 3 | 10.0% |
| r | 2 | 6.7% |
| s | 2 | 6.7% |
| c | 2 | 6.7% |
| i | 2 | 6.7% |
| H | 2 | 6.7% |
| f | 2 | 6.7% |
| m | 2 | 6.7% |
| P | 1 | 3.3% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| . | 2 | 22.2% |
| ) | 1 | 11.1% |
| ( | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38 | |
| None | 1 | 2.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | ||
| a | 4 | 10.5% |
| e | 3 | 7.9% |
| r | 2 | 5.3% |
| s | 2 | 5.3% |
| c | 2 | 5.3% |
| i | 2 | 5.3% |
| H | 2 | 5.3% |
| f | 2 | 5.3% |
| m | 2 | 5.3% |
| Other values (11) | 12 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 360 |
|---|---|
| Distinct (%) | 11.7% |
| Missing | 833143 |
| Missing (%) | 99.6% |
| Memory size | 6.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 12.61448141 |
| Min length | 3 |
Unique
| Unique | 155 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 922.1985434932673 |
| 3rd row | 0.0 |
| 4th row | 2546.249171408145 |
| 5th row | 2546.249171408145 |
| Value | Count | Frequency (%) |
| 0.0 | 1013 | |
| 922.1985434932673 | 188 | 6.1% |
| 2546.249171408145 | 115 | 3.8% |
| 3183.772359296243 | 101 | 3.3% |
| 2983.0798593133177 | 95 | 3.1% |
| 4504.128742457356 | 95 | 3.1% |
| 4746.962209460676 | 53 | 1.7% |
| 4281.9160661722035 | 48 | 1.6% |
| 3983.9929662504123 | 41 | 1.3% |
| 2239.9416356999986 | 34 | 1.1% |
| Other values (350) | 1283 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4412 | |
| 4 | 4164 | |
| 2 | 3985 | |
| 3 | 3884 | |
| 9 | 3710 | |
| 1 | 3309 | |
| 5 | 3249 | |
| 6 | 3086 | |
| . | 3066 | |
| 7 | 2990 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 35610 | |
| Other Punctuation | 3066 | 7.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4412 | |
| 4 | 4164 | |
| 2 | 3985 | |
| 3 | 3884 | |
| 9 | 3710 | |
| 1 | 3309 | |
| 5 | 3249 | |
| 6 | 3086 | |
| 7 | 2990 | |
| 8 | 2821 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3066 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 38676 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4412 | |
| 4 | 4164 | |
| 2 | 3985 | |
| 3 | 3884 | |
| 9 | 3710 | |
| 1 | 3309 | |
| 5 | 3249 | |
| 6 | 3086 | |
| . | 3066 | |
| 7 | 2990 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38676 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4412 | |
| 4 | 4164 | |
| 2 | 3985 | |
| 3 | 3884 | |
| 9 | 3710 | |
| 1 | 3309 | |
| 5 | 3249 | |
| 6 | 3086 | |
| . | 3066 | |
| 7 | 2990 |
issue
Text
Missing 
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 776215 |
| Missing (%) | 92.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 107 |
|---|---|
| Median length | 22 |
| Mean length | 24.5116845 |
| Min length | 11 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TAXON_MATCH_HIGHERRANK |
|---|---|
| 2nd row | TAXON_MATCH_HIGHERRANK |
| 3rd row | TAXON_MATCH_HIGHERRANK |
| 4th row | TAXON_MATCH_HIGHERRANK |
| 5th row | PRESUMED_NEGATED_LATITUDE |
| Value | Count | Frequency (%) |
| taxon_match_higherrank | 30836 | |
| taxon_match_fuzzy | 11173 | 18.6% |
| continent_coordinate_mismatch | 7741 | 12.9% |
| continent_country_mismatch | 2069 | 3.4% |
| country_invalid | 2031 | 3.4% |
| country_coordinate_mismatch | 946 | 1.6% |
| country_derived_from_coordinates;country_invalid | 906 | 1.5% |
| country_coordinate_mismatch;continent_derived_from_coordinates | 858 | 1.4% |
| continent_coordinate_mismatch;continent_country_mismatch | 764 | 1.3% |
| country_coordinate_mismatch;continent_coordinate_mismatch | 540 | 0.9% |
| Other values (52) | 2130 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 155712 | |
| A | 152174 | |
| N | 145950 | |
| _ | 128614 | |
| H | 121531 | 8.3% |
| O | 99418 | 6.8% |
| C | 97399 | 6.6% |
| R | 93204 | 6.3% |
| I | 85718 | 5.8% |
| M | 76728 | 5.2% |
| Other values (16) | 314106 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1337121 | |
| Connector Punctuation | 128614 | 8.7% |
| Other Punctuation | 4819 | 0.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 155712 | |
| A | 152174 | |
| N | 145950 | |
| H | 121531 | |
| O | 99418 | |
| C | 97399 | |
| R | 93204 | 7.0% |
| I | 85718 | 6.4% |
| M | 76728 | 5.7% |
| E | 67816 | 5.1% |
| Other values (14) | 241471 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 128614 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 4819 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1337121 | |
| Common | 133433 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 155712 | |
| A | 152174 | |
| N | 145950 | |
| H | 121531 | |
| O | 99418 | |
| C | 97399 | |
| R | 93204 | 7.0% |
| I | 85718 | 6.4% |
| M | 76728 | 5.7% |
| E | 67816 | 5.1% |
| Other values (14) | 241471 |
Common
| Value | Count | Frequency (%) |
| _ | 128614 | |
| ; | 4819 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1470554 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 155712 | |
| A | 152174 | |
| N | 145950 | |
| _ | 128614 | |
| H | 121531 | 8.3% |
| O | 99418 | 6.8% |
| C | 97399 | 6.6% |
| R | 93204 | 6.3% |
| I | 85718 | 5.8% |
| M | 76728 | 5.2% |
| Other values (16) | 314106 |
mediaType
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 57645 |
| Missing (%) | 6.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 131 |
|---|---|
| Median length | 10 |
| Mean length | 10.01355316 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 777657 | |
| stillimage;stillimage | 883 | 0.1% |
| stillimage;stillimage;stillimage;stillimage | 8 | < 0.1% |
| stillimage;stillimage;stillimage | 8 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 6 | < 0.1% |
| 2024-11-01t10:28:05.946z | 1 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1559042 | |
| S | 779521 | |
| t | 779521 | |
| i | 779521 | |
| I | 779521 | |
| m | 779521 | |
| a | 779521 | |
| g | 779521 | |
| e | 779521 | |
| ; | 958 | < 0.1% |
| Other values (13) | 24 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6236168 | |
| Uppercase Letter | 1559044 | 20.0% |
| Other Punctuation | 961 | < 0.1% |
| Decimal Number | 17 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 0 | 4 | |
| 2 | 3 | |
| 4 | 2 | |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 9 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1559042 | |
| t | 779521 | |
| i | 779521 | |
| m | 779521 | |
| a | 779521 | |
| g | 779521 | |
| e | 779521 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 779521 | |
| I | 779521 | |
| T | 1 | < 0.1% |
| Z | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 958 | |
| : | 2 | 0.2% |
| . | 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7795212 | |
| Common | 980 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| ; | 958 | |
| 1 | 4 | 0.4% |
| 0 | 4 | 0.4% |
| 2 | 3 | 0.3% |
| 4 | 2 | 0.2% |
| - | 2 | 0.2% |
| : | 2 | 0.2% |
| 8 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| . | 1 | 0.1% |
| Other values (2) | 2 | 0.2% |
Latin
| Value | Count | Frequency (%) |
| l | 1559042 | |
| S | 779521 | |
| t | 779521 | |
| i | 779521 | |
| I | 779521 | |
| m | 779521 | |
| a | 779521 | |
| g | 779521 | |
| e | 779521 | |
| T | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7796192 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1559042 | |
| S | 779521 | |
| t | 779521 | |
| i | 779521 | |
| I | 779521 | |
| m | 779521 | |
| a | 779521 | |
| g | 779521 | |
| e | 779521 | |
| ; | 958 | < 0.1% |
| Other values (13) | 24 | < 0.1% |
hasCoordinate
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 5 |
| Mean length | 4.577694784 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 483053 | |
| true | 353154 | |
| 2024-11-01t08:50:07.799z | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 836207 | |
| f | 483053 | |
| l | 483053 | |
| s | 483053 | |
| a | 483053 | |
| t | 353154 | |
| r | 353154 | |
| u | 353154 | |
| 0 | 5 | < 0.1% |
| 1 | 3 | < 0.1% |
| Other values (11) | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3827881 | |
| Decimal Number | 17 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 836207 | |
| f | 483053 | |
| l | 483053 | |
| s | 483053 | |
| a | 483053 | |
| t | 353154 | |
| r | 353154 | |
| u | 353154 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 3 | |
| 7 | 2 | 11.8% |
| 2 | 2 | 11.8% |
| 9 | 2 | 11.8% |
| 4 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 8 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3827883 | |
| Common | 22 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 3 | |
| 7 | 2 | 9.1% |
| 2 | 2 | 9.1% |
| - | 2 | 9.1% |
| 9 | 2 | 9.1% |
| : | 2 | 9.1% |
| . | 1 | 4.5% |
| 4 | 1 | 4.5% |
| 5 | 1 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| e | 836207 | |
| f | 483053 | |
| l | 483053 | |
| s | 483053 | |
| a | 483053 | |
| t | 353154 | |
| r | 353154 | |
| u | 353154 | |
| T | 1 | < 0.1% |
| Z | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3827905 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 836207 | |
| f | 483053 | |
| l | 483053 | |
| s | 483053 | |
| a | 483053 | |
| t | 353154 | |
| r | 353154 | |
| u | 353154 | |
| 0 | 5 | < 0.1% |
| 1 | 3 | < 0.1% |
| Other values (11) | 16 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.996380087 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 833181 | |
| true | 3027 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 836208 | |
| f | 833181 | |
| a | 833181 | |
| l | 833181 | |
| s | 833181 | |
| t | 3027 | 0.1% |
| r | 3027 | 0.1% |
| u | 3027 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4178013 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 836208 | |
| f | 833181 | |
| a | 833181 | |
| l | 833181 | |
| s | 833181 | |
| t | 3027 | 0.1% |
| r | 3027 | 0.1% |
| u | 3027 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4178013 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 836208 | |
| f | 833181 | |
| a | 833181 | |
| l | 833181 | |
| s | 833181 | |
| t | 3027 | 0.1% |
| r | 3027 | 0.1% |
| u | 3027 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4178013 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 836208 | |
| f | 833181 | |
| a | 833181 | |
| l | 833181 | |
| s | 833181 | |
| t | 3027 | 0.1% |
| r | 3027 | 0.1% |
| u | 3027 | 0.1% |
taxonKey
Text
| Distinct | 160036 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.959800624 |
| Min length | 1 |
Unique
| Unique | 76666 ? |
|---|---|
| Unique (%) | 9.2% |
Sample
| 1st row | 3189695 |
|---|---|
| 2nd row | 4097456 |
| 3rd row | 3189695 |
| 4th row | 5284426 |
| 5th row | 3189695 |
| Value | Count | Frequency (%) |
| 6 | 3615 | 0.4% |
| 11238428 | 1484 | 0.2% |
| 3177662 | 1278 | 0.2% |
| 2919963 | 909 | 0.1% |
| 3189556 | 699 | 0.1% |
| 3136365 | 627 | 0.1% |
| 3033976 | 605 | 0.1% |
| 3065 | 604 | 0.1% |
| 3029010 | 590 | 0.1% |
| 8798 | 574 | 0.1% |
| Other values (160026) | 825222 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 746905 | |
| 2 | 693044 | |
| 7 | 628059 | |
| 5 | 626227 | |
| 8 | 550520 | |
| 0 | 524218 | |
| 9 | 522549 | |
| 1 | 521280 | |
| 6 | 515660 | |
| 4 | 491372 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5819834 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 746905 | |
| 2 | 693044 | |
| 7 | 628059 | |
| 5 | 626227 | |
| 8 | 550520 | |
| 0 | 524218 | |
| 9 | 522549 | |
| 1 | 521280 | |
| 6 | 515660 | |
| 4 | 491372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5819834 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 746905 | |
| 2 | 693044 | |
| 7 | 628059 | |
| 5 | 626227 | |
| 8 | 550520 | |
| 0 | 524218 | |
| 9 | 522549 | |
| 1 | 521280 | |
| 6 | 515660 | |
| 4 | 491372 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5819834 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 746905 | |
| 2 | 693044 | |
| 7 | 628059 | |
| 5 | 626227 | |
| 8 | 550520 | |
| 0 | 524218 | |
| 9 | 522549 | |
| 1 | 521280 | |
| 6 | 515660 | |
| 4 | 491372 |
acceptedTaxonKey
Text
| Distinct | 126970 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 48 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.962477322 |
| Min length | 1 |
Unique
| Unique | 51536 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | 3189695 |
|---|---|
| 2nd row | 4097456 |
| 3rd row | 3189695 |
| 4th row | 5284426 |
| 5th row | 3189695 |
| Value | Count | Frequency (%) |
| 6 | 3615 | 0.4% |
| 329 | 1484 | 0.2% |
| 3177662 | 1278 | 0.2% |
| 2919963 | 968 | 0.1% |
| 9458333 | 756 | 0.1% |
| 3189556 | 710 | 0.1% |
| 3061139 | 634 | 0.1% |
| 3029010 | 607 | 0.1% |
| 3033976 | 605 | 0.1% |
| 3065 | 604 | 0.1% |
| Other values (126960) | 824900 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5821752 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5821752 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5821752 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 742621 | |
| 2 | 701754 | |
| 7 | 632660 | |
| 5 | 617498 | |
| 8 | 540994 | |
| 1 | 531253 | |
| 0 | 530978 | |
| 9 | 526129 | |
| 6 | 511504 | |
| 4 | 486361 |
kingdomKey
Text
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.000004783 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 6 |
| 3rd row | 6 |
| 4th row | 6 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 6 | 810590 | |
| 5 | 16418 | 2.0% |
| 4 | 6571 | 0.8% |
| 3 | 2508 | 0.3% |
| 7 | 73 | < 0.1% |
| 0 | 46 | < 0.1% |
| false | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 810590 | |
| 5 | 16418 | 2.0% |
| 4 | 6571 | 0.8% |
| 3 | 2508 | 0.3% |
| 7 | 73 | < 0.1% |
| 0 | 46 | < 0.1% |
| f | 1 | < 0.1% |
| a | 1 | < 0.1% |
| l | 1 | < 0.1% |
| s | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 836207 | |
| Lowercase Letter | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 810590 | |
| 5 | 16418 | 2.0% |
| 4 | 6571 | 0.8% |
| 3 | 2508 | 0.3% |
| 7 | 73 | < 0.1% |
| 0 | 46 | < 0.1% |
| 1 | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 836207 | |
| Latin | 5 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 810590 | |
| 5 | 16418 | 2.0% |
| 4 | 6571 | 0.8% |
| 3 | 2508 | 0.3% |
| 7 | 73 | < 0.1% |
| 0 | 46 | < 0.1% |
| 1 | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 836212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 810590 | |
| 5 | 16418 | 2.0% |
| 4 | 6571 | 0.8% |
| 3 | 2508 | 0.3% |
| 7 | 73 | < 0.1% |
| 0 | 46 | < 0.1% |
| f | 1 | < 0.1% |
| a | 1 | < 0.1% |
| l | 1 | < 0.1% |
| s | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
phylumKey
Text
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4178 |
| Missing (%) | 0.5% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.669377703 |
| Min length | 1 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 7707728 |
|---|---|
| 2nd row | 7707728 |
| 3rd row | 7707728 |
| 4th row | 7707728 |
| 5th row | 7707728 |
| Value | Count | Frequency (%) |
| 7707728 | 772865 | |
| 106 | 12168 | 1.5% |
| 35 | 10128 | 1.2% |
| 34 | 8611 | 1.0% |
| 36 | 7692 | 0.9% |
| 95 | 7630 | 0.9% |
| 98 | 6511 | 0.8% |
| 68 | 2493 | 0.3% |
| 7819616 | 2034 | 0.2% |
| 9 | 1729 | 0.2% |
| Other values (15) | 170 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 3093521 | |
| 0 | 785051 | 14.1% |
| 8 | 783924 | 14.1% |
| 2 | 772945 | 13.9% |
| 3 | 26642 | 0.5% |
| 6 | 26422 | 0.5% |
| 9 | 17930 | 0.3% |
| 5 | 17769 | 0.3% |
| 1 | 16296 | 0.3% |
| 4 | 8623 | 0.2% |
| Other values (5) | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5549123 | |
| Uppercase Letter | 6 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 3093521 | |
| 0 | 785051 | 14.1% |
| 8 | 783924 | 14.1% |
| 2 | 772945 | 13.9% |
| 3 | 26642 | 0.5% |
| 6 | 26422 | 0.5% |
| 9 | 17930 | 0.3% |
| 5 | 17769 | 0.3% |
| 1 | 16296 | 0.3% |
| 4 | 8623 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2 | |
| U | 1 | |
| R | 1 | |
| O | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5549123 | |
| Latin | 6 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 3093521 | |
| 0 | 785051 | 14.1% |
| 8 | 783924 | 14.1% |
| 2 | 772945 | 13.9% |
| 3 | 26642 | 0.5% |
| 6 | 26422 | 0.5% |
| 9 | 17930 | 0.3% |
| 5 | 17769 | 0.3% |
| 1 | 16296 | 0.3% |
| 4 | 8623 | 0.2% |
Latin
| Value | Count | Frequency (%) |
| E | 2 | |
| U | 1 | |
| R | 1 | |
| O | 1 | |
| P | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5549129 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 3093521 | |
| 0 | 785051 | 14.1% |
| 8 | 783924 | 14.1% |
| 2 | 772945 | 13.9% |
| 3 | 26642 | 0.5% |
| 6 | 26422 | 0.5% |
| 9 | 17930 | 0.3% |
| 5 | 17769 | 0.3% |
| 1 | 16296 | 0.3% |
| 4 | 8623 | 0.2% |
| Other values (5) | 6 | < 0.1% |
classKey
Text
| Distinct | 76 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4394 |
| Missing (%) | 0.5% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.232160997 |
| Min length | 3 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 220 |
|---|---|
| 2nd row | 220 |
| 3rd row | 220 |
| 4th row | 194 |
| 5th row | 220 |
| Value | Count | Frequency (%) |
| 220 | 602481 | |
| 196 | 124141 | 14.9% |
| 7228684 | 37585 | 4.5% |
| 342 | 11412 | 1.4% |
| 327 | 9385 | 1.1% |
| 186 | 8120 | 1.0% |
| 7073593 | 5400 | 0.6% |
| 195 | 5312 | 0.6% |
| 180 | 4420 | 0.5% |
| 245 | 3965 | 0.5% |
| Other values (66) | 19594 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1314439 | |
| 0 | 617260 | |
| 6 | 175553 | 6.5% |
| 1 | 156948 | 5.8% |
| 9 | 144675 | 5.4% |
| 8 | 92312 | 3.4% |
| 7 | 67617 | 2.5% |
| 4 | 61556 | 2.3% |
| 3 | 42529 | 1.6% |
| 5 | 15665 | 0.6% |
| Other values (5) | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2688554 | |
| Uppercase Letter | 6 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1314439 | |
| 0 | 617260 | |
| 6 | 175553 | 6.5% |
| 1 | 156948 | 5.8% |
| 9 | 144675 | 5.4% |
| 8 | 92312 | 3.4% |
| 7 | 67617 | 2.5% |
| 4 | 61556 | 2.3% |
| 3 | 42529 | 1.6% |
| 5 | 15665 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2 | |
| U | 1 | |
| R | 1 | |
| O | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2688554 | |
| Latin | 6 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1314439 | |
| 0 | 617260 | |
| 6 | 175553 | 6.5% |
| 1 | 156948 | 5.8% |
| 9 | 144675 | 5.4% |
| 8 | 92312 | 3.4% |
| 7 | 67617 | 2.5% |
| 4 | 61556 | 2.3% |
| 3 | 42529 | 1.6% |
| 5 | 15665 | 0.6% |
Latin
| Value | Count | Frequency (%) |
| E | 2 | |
| U | 1 | |
| R | 1 | |
| O | 1 | |
| P | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2688560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1314439 | |
| 0 | 617260 | |
| 6 | 175553 | 6.5% |
| 1 | 156948 | 5.8% |
| 9 | 144675 | 5.4% |
| 8 | 92312 | 3.4% |
| 7 | 67617 | 2.5% |
| 4 | 61556 | 2.3% |
| 3 | 42529 | 1.6% |
| 5 | 15665 | 0.6% |
| Other values (5) | 6 | < 0.1% |
orderKey
Text
| Distinct | 379 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6630 |
| Missing (%) | 0.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.671659962 |
| Min length | 3 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 408 |
|---|---|
| 2nd row | 941 |
| 3rd row | 408 |
| 4th row | 640 |
| 5th row | 408 |
| Value | Count | Frequency (%) |
| 1369 | 73718 | 8.9% |
| 414 | 57572 | 6.9% |
| 1414 | 56399 | 6.8% |
| 1370 | 55446 | 6.7% |
| 408 | 55104 | 6.6% |
| 412 | 52371 | 6.3% |
| 691 | 40401 | 4.9% |
| 1353 | 30937 | 3.7% |
| 422 | 30623 | 3.7% |
| 392 | 28749 | 3.5% |
| Other values (369) | 348259 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 658148 | |
| 4 | 499634 | |
| 3 | 360783 | |
| 9 | 328336 | |
| 2 | 299776 | |
| 6 | 252940 | 8.3% |
| 0 | 216746 | 7.1% |
| 7 | 171731 | 5.6% |
| 5 | 160152 | 5.3% |
| 8 | 97686 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3045932 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 658148 | |
| 4 | 499634 | |
| 3 | 360783 | |
| 9 | 328336 | |
| 2 | 299776 | |
| 6 | 252940 | 8.3% |
| 0 | 216746 | 7.1% |
| 7 | 171731 | 5.6% |
| 5 | 160152 | 5.3% |
| 8 | 97686 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3045932 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 658148 | |
| 4 | 499634 | |
| 3 | 360783 | |
| 9 | 328336 | |
| 2 | 299776 | |
| 6 | 252940 | 8.3% |
| 0 | 216746 | 7.1% |
| 7 | 171731 | 5.6% |
| 5 | 160152 | 5.3% |
| 8 | 97686 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3045932 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 658148 | |
| 4 | 499634 | |
| 3 | 360783 | |
| 9 | 328336 | |
| 2 | 299776 | |
| 6 | 252940 | 8.3% |
| 0 | 216746 | 7.1% |
| 7 | 171731 | 5.6% |
| 5 | 160152 | 5.3% |
| 8 | 97686 | 3.2% |
familyKey
Text
| Distinct | 1416 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 6696 |
| Missing (%) | 0.8% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.147109207 |
| Min length | 4 |
Unique
| Unique | 175 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2420 |
|---|---|
| 2nd row | 6645 |
| 3rd row | 2420 |
| 4th row | 3924 |
| 5th row | 2420 |
| Value | Count | Frequency (%) |
| 5386 | 52179 | 6.3% |
| 3065 | 51696 | 6.2% |
| 3073 | 43659 | 5.3% |
| 8798 | 32694 | 3.9% |
| 7708 | 22275 | 2.7% |
| 2497 | 20240 | 2.4% |
| 5015 | 19433 | 2.3% |
| 7689 | 15993 | 1.9% |
| 4691 | 15181 | 1.8% |
| 6685 | 13704 | 1.7% |
| Other values (1406) | 542459 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 584173 | |
| 3 | 419239 | |
| 8 | 373106 | |
| 7 | 361654 | |
| 2 | 327882 | |
| 0 | 316208 | |
| 5 | 303481 | |
| 4 | 271279 | |
| 9 | 248947 | |
| 1 | 234112 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3440081 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 584173 | |
| 3 | 419239 | |
| 8 | 373106 | |
| 7 | 361654 | |
| 2 | 327882 | |
| 0 | 316208 | |
| 5 | 303481 | |
| 4 | 271279 | |
| 9 | 248947 | |
| 1 | 234112 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3440081 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 584173 | |
| 3 | 419239 | |
| 8 | 373106 | |
| 7 | 361654 | |
| 2 | 327882 | |
| 0 | 316208 | |
| 5 | 303481 | |
| 4 | 271279 | |
| 9 | 248947 | |
| 1 | 234112 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3440081 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 584173 | |
| 3 | 419239 | |
| 8 | 373106 | |
| 7 | 361654 | |
| 2 | 327882 | |
| 0 | 316208 | |
| 5 | 303481 | |
| 4 | 271279 | |
| 9 | 248947 | |
| 1 | 234112 |
genusKey
Text
Missing 
| Distinct | 14164 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 13165 |
| Missing (%) | 1.6% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.020595497 |
| Min length | 7 |
Unique
| Unique | 2534 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 3189695 |
|---|---|
| 2nd row | 10803341 |
| 3rd row | 3189695 |
| 4th row | 2685008 |
| 5th row | 3189695 |
| Value | Count | Frequency (%) |
| 2721893 | 9711 | 1.2% |
| 2984588 | 7339 | 0.9% |
| 2988638 | 6530 | 0.8% |
| 7787708 | 5116 | 0.6% |
| 2713455 | 4059 | 0.5% |
| 3039576 | 3696 | 0.4% |
| 3033294 | 3488 | 0.4% |
| 2913027 | 3355 | 0.4% |
| 11397237 | 3348 | 0.4% |
| 2650583 | 3302 | 0.4% |
| Other values (14154) | 773100 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 817492 | |
| 3 | 775895 | |
| 7 | 612005 | |
| 9 | 570180 | |
| 8 | 565886 | |
| 1 | 546714 | |
| 0 | 542830 | |
| 6 | 485847 | |
| 5 | 463136 | |
| 4 | 398274 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5778259 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 817492 | |
| 3 | 775895 | |
| 7 | 612005 | |
| 9 | 570180 | |
| 8 | 565886 | |
| 1 | 546714 | |
| 0 | 542830 | |
| 6 | 485847 | |
| 5 | 463136 | |
| 4 | 398274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5778259 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 817492 | |
| 3 | 775895 | |
| 7 | 612005 | |
| 9 | 570180 | |
| 8 | 565886 | |
| 1 | 546714 | |
| 0 | 542830 | |
| 6 | 485847 | |
| 5 | 463136 | |
| 4 | 398274 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5778259 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 817492 | |
| 3 | 775895 | |
| 7 | 612005 | |
| 9 | 570180 | |
| 8 | 565886 | |
| 1 | 546714 | |
| 0 | 542830 | |
| 6 | 485847 | |
| 5 | 463136 | |
| 4 | 398274 |
speciesKey
Text
Missing 
| Distinct | 111719 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 78171 |
| Missing (%) | 9.3% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.020400033 |
| Min length | 7 |
Unique
| Unique | 44299 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | 4097456 |
|---|---|
| 2nd row | 5284426 |
| 3rd row | 4096206 |
| 4th row | 2886544 |
| 5th row | 3189723 |
| Value | Count | Frequency (%) |
| 9458333 | 757 | 0.1% |
| 7558421 | 511 | 0.1% |
| 9364157 | 471 | 0.1% |
| 2810155 | 420 | 0.1% |
| 2704922 | 413 | 0.1% |
| 8179794 | 383 | 0.1% |
| 2882482 | 380 | 0.1% |
| 2975014 | 376 | < 0.1% |
| 2913130 | 373 | < 0.1% |
| 5350452 | 354 | < 0.1% |
| Other values (111709) | 753600 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 682173 | |
| 2 | 628214 | |
| 5 | 581954 | |
| 7 | 569005 | |
| 8 | 499741 | |
| 1 | 487025 | |
| 0 | 485699 | |
| 9 | 476907 | |
| 4 | 456796 | |
| 6 | 454216 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5321730 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 682173 | |
| 2 | 628214 | |
| 5 | 581954 | |
| 7 | 569005 | |
| 8 | 499741 | |
| 1 | 487025 | |
| 0 | 485699 | |
| 9 | 476907 | |
| 4 | 456796 | |
| 6 | 454216 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5321730 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 682173 | |
| 2 | 628214 | |
| 5 | 581954 | |
| 7 | 569005 | |
| 8 | 499741 | |
| 1 | 487025 | |
| 0 | 485699 | |
| 9 | 476907 | |
| 4 | 456796 | |
| 6 | 454216 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5321730 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 682173 | |
| 2 | 628214 | |
| 5 | 581954 | |
| 7 | 569005 | |
| 8 | 499741 | |
| 1 | 487025 | |
| 0 | 485699 | |
| 9 | 476907 | |
| 4 | 456796 | |
| 6 | 454216 |
species
Text
Missing 
| Distinct | 111366 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 78171 |
| Missing (%) | 9.3% |
| Memory size | 6.4 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 32 |
| Mean length | 18.62692635 |
| Min length | 8 |
Unique
| Unique | 44055 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | Shorea platycarpa |
|---|---|
| 2nd row | Agathis borneensis |
| 3rd row | Shorea hopeifolia |
| 4th row | Palaquium hexandrum |
| 5th row | Plantago ovata |
| Value | Count | Frequency (%) |
| carex | 9451 | 0.6% |
| ficus | 7087 | 0.5% |
| rubus | 6054 | 0.4% |
| taraxacum | 4923 | 0.3% |
| vulgaris | 4105 | 0.3% |
| cyperus | 3670 | 0.2% |
| ranunculus | 3401 | 0.2% |
| salix | 3385 | 0.2% |
| galium | 3222 | 0.2% |
| euphorbia | 3160 | 0.2% |
| Other values (49526) | 1467762 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1748365 | 12.4% |
| i | 1353758 | 9.6% |
| e | 927246 | 6.6% |
| r | 886879 | 6.3% |
| s | 873334 | 6.2% |
| o | 829205 | 5.9% |
| l | 796707 | 5.6% |
| u | 784536 | 5.6% |
| 758182 | 5.4% | |
| n | 758181 | 5.4% |
| Other values (45) | 4403525 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12598753 | |
| Space Separator | 758182 | 5.4% |
| Uppercase Letter | 758103 | 5.4% |
| Dash Punctuation | 4861 | < 0.1% |
| Math Symbol | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1748365 | |
| i | 1353758 | |
| e | 927246 | 7.4% |
| r | 886879 | 7.0% |
| s | 873334 | 6.9% |
| o | 829205 | 6.6% |
| l | 796707 | 6.3% |
| u | 784536 | 6.2% |
| n | 758181 | 6.0% |
| t | 627646 | 5.0% |
| Other values (16) | 3012896 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 103470 | |
| P | 79067 | 10.4% |
| S | 73083 | 9.6% |
| A | 72797 | 9.6% |
| M | 44569 | 5.9% |
| L | 40379 | 5.3% |
| D | 40075 | 5.3% |
| T | 37129 | 4.9% |
| E | 35245 | 4.6% |
| G | 33655 | 4.4% |
| Other values (16) | 198634 |
Space Separator
| Value | Count | Frequency (%) |
| 758182 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4861 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13356856 | |
| Common | 763062 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1748365 | |
| i | 1353758 | 10.1% |
| e | 927246 | 6.9% |
| r | 886879 | 6.6% |
| s | 873334 | 6.5% |
| o | 829205 | 6.2% |
| l | 796707 | 6.0% |
| u | 784536 | 5.9% |
| n | 758181 | 5.7% |
| t | 627646 | 4.7% |
| Other values (42) | 3770999 |
Common
| Value | Count | Frequency (%) |
| 758182 | ||
| - | 4861 | 0.6% |
| × | 19 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14119899 | |
| None | 19 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1748365 | 12.4% |
| i | 1353758 | 9.6% |
| e | 927246 | 6.6% |
| r | 886879 | 6.3% |
| s | 873334 | 6.2% |
| o | 829205 | 5.9% |
| l | 796707 | 5.6% |
| u | 784536 | 5.6% |
| 758182 | 5.4% | |
| n | 758181 | 5.4% |
| Other values (44) | 4403506 |
None
| Value | Count | Frequency (%) |
| × | 19 |
| Distinct | 126970 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 48 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 152 |
|---|---|
| Median length | 95 |
| Mean length | 29.51563515 |
| Min length | 5 |
Unique
| Unique | 51536 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | Plantago L. |
|---|---|
| 2nd row | Shorea platycarpa F.Heim |
| 3rd row | Plantago L. |
| 4th row | Agathis borneensis Warb. |
| 5th row | Plantago L. |
| Value | Count | Frequency (%) |
| l | 230364 | 7.4% |
| 85958 | 2.8% | |
| ex | 51904 | 1.7% |
| subsp | 38063 | 1.2% |
| blume | 32827 | 1.1% |
| var | 17559 | 0.6% |
| dc | 17352 | 0.6% |
| benth | 14051 | 0.5% |
| miq | 11890 | 0.4% |
| willd | 10347 | 0.3% |
| Other values (63531) | 2597773 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2271927 | 9.2% | |
| a | 2240310 | 9.1% |
| i | 1724481 | 7.0% |
| e | 1527392 | 6.2% |
| r | 1349022 | 5.5% |
| l | 1227513 | 5.0% |
| s | 1201503 | 4.9% |
| o | 1166198 | 4.7% |
| . | 1157405 | 4.7% |
| n | 1097269 | 4.4% |
| Other values (112) | 9716803 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18079514 | |
| Uppercase Letter | 2406058 | 9.7% |
| Space Separator | 2271927 | 9.2% |
| Other Punctuation | 1266100 | 5.1% |
| Close Punctuation | 298092 | 1.2% |
| Open Punctuation | 298092 | 1.2% |
| Decimal Number | 42748 | 0.2% |
| Dash Punctuation | 13201 | 0.1% |
| Math Symbol | 4070 | < 0.1% |
| Connector Punctuation | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2240310 | |
| i | 1724481 | 9.5% |
| e | 1527392 | 8.4% |
| r | 1349022 | 7.5% |
| l | 1227513 | 6.8% |
| s | 1201503 | 6.6% |
| o | 1166198 | 6.5% |
| n | 1097269 | 6.1% |
| u | 1091448 | 6.0% |
| t | 891929 | 4.9% |
| Other values (54) | 4562449 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 340209 | |
| C | 201995 | 8.4% |
| S | 200872 | 8.3% |
| B | 180383 | 7.5% |
| P | 157094 | 6.5% |
| M | 155909 | 6.5% |
| A | 146316 | 6.1% |
| H | 129394 | 5.4% |
| D | 119052 | 4.9% |
| R | 117798 | 4.9% |
| Other values (28) | 657036 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 12439 | |
| 8 | 8588 | |
| 9 | 4589 | 10.7% |
| 2 | 2962 | 6.9% |
| 7 | 2867 | 6.7% |
| 3 | 2823 | 6.6% |
| 0 | 2622 | 6.1% |
| 4 | 2565 | 6.0% |
| 5 | 1962 | 4.6% |
| 6 | 1331 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1157405 | |
| & | 85958 | 6.8% |
| , | 21063 | 1.7% |
| ' | 1674 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2271927 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 298092 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 298092 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13201 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 4070 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20485572 | |
| Common | 4194251 | 17.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2240310 | 10.9% |
| i | 1724481 | 8.4% |
| e | 1527392 | 7.5% |
| r | 1349022 | 6.6% |
| l | 1227513 | 6.0% |
| s | 1201503 | 5.9% |
| o | 1166198 | 5.7% |
| n | 1097269 | 5.4% |
| u | 1091448 | 5.3% |
| t | 891929 | 4.4% |
| Other values (92) | 6968507 |
Common
| Value | Count | Frequency (%) |
| 2271927 | ||
| . | 1157405 | |
| ) | 298092 | 7.1% |
| ( | 298092 | 7.1% |
| & | 85958 | 2.0% |
| , | 21063 | 0.5% |
| - | 13201 | 0.3% |
| 1 | 12439 | 0.3% |
| 8 | 8588 | 0.2% |
| 9 | 4589 | 0.1% |
| Other values (10) | 22897 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24641457 | |
| None | 38366 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2271927 | 9.2% | |
| a | 2240310 | 9.1% |
| i | 1724481 | 7.0% |
| e | 1527392 | 6.2% |
| r | 1349022 | 5.5% |
| l | 1227513 | 5.0% |
| s | 1201503 | 4.9% |
| o | 1166198 | 4.7% |
| . | 1157405 | 4.7% |
| n | 1097269 | 4.5% |
| Other values (61) | 9678437 |
None
| Value | Count | Frequency (%) |
| ü | 13590 | |
| é | 7617 | |
| × | 4070 | 10.6% |
| ö | 3266 | 8.5% |
| á | 2018 | 5.3% |
| ä | 1595 | 4.2% |
| ó | 1102 | 2.9% |
| è | 637 | 1.7% |
| Á | 609 | 1.6% |
| ø | 593 | 1.5% |
| Other values (41) | 3269 | 8.5% |
| Distinct | 175339 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 45 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 84 |
| Mean length | 28.43354175 |
| Min length | 3 |
Unique
| Unique | 90006 ? |
|---|---|
| Unique (%) | 10.8% |
Sample
| 1st row | Plantago psyllium L. |
|---|---|
| 2nd row | Shorea platycarpa Heim |
| 3rd row | Plantago psyllium L. |
| 4th row | Agathis borneensis Warb. |
| 5th row | Plantago psyllium L. |
| Value | Count | Frequency (%) |
| l | 208156 | 7.0% |
| 59763 | 2.0% | |
| ex | 42677 | 1.4% |
| var | 39274 | 1.3% |
| blume | 30277 | 1.0% |
| subsp | 26912 | 0.9% |
| dc | 18463 | 0.6% |
| benth | 14237 | 0.5% |
| indet | 12644 | 0.4% |
| miq | 12572 | 0.4% |
| Other values (72954) | 2530009 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2207594 | 9.3% |
| 2158927 | 9.1% | |
| i | 1707901 | 7.2% |
| e | 1477517 | 6.2% |
| r | 1333444 | 5.6% |
| l | 1185137 | 5.0% |
| s | 1163476 | 4.9% |
| o | 1123360 | 4.7% |
| u | 1075965 | 4.5% |
| . | 1074132 | 4.5% |
| Other values (108) | 9267651 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17714410 | |
| Uppercase Letter | 2225741 | 9.4% |
| Space Separator | 2158927 | 9.1% |
| Other Punctuation | 1149367 | 4.8% |
| Open Punctuation | 256229 | 1.1% |
| Close Punctuation | 256228 | 1.1% |
| Dash Punctuation | 11272 | < 0.1% |
| Math Symbol | 1535 | < 0.1% |
| Decimal Number | 1395 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2207594 | |
| i | 1707901 | 9.6% |
| e | 1477517 | 8.3% |
| r | 1333444 | 7.5% |
| l | 1185137 | 6.7% |
| s | 1163476 | 6.6% |
| o | 1123360 | 6.3% |
| u | 1075965 | 6.1% |
| n | 1072354 | 6.1% |
| t | 884232 | 5.0% |
| Other values (44) | 4483430 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 306900 | |
| C | 198352 | 8.9% |
| S | 188310 | 8.5% |
| B | 169967 | 7.6% |
| M | 141899 | 6.4% |
| P | 141155 | 6.3% |
| A | 140243 | 6.3% |
| H | 122694 | 5.5% |
| D | 113713 | 5.1% |
| R | 103647 | 4.7% |
| Other values (24) | 598861 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1074132 | |
| & | 59726 | 5.2% |
| ' | 12741 | 1.1% |
| , | 2682 | 0.2% |
| " | 49 | < 0.1% |
| ? | 25 | < 0.1% |
| ! | 8 | < 0.1% |
| / | 1 | < 0.1% |
| • | 1 | < 0.1% |
| ; | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 947 | |
| 2 | 274 | 19.6% |
| 4 | 32 | 2.3% |
| 3 | 29 | 2.1% |
| 0 | 29 | 2.1% |
| 6 | 23 | 1.6% |
| 7 | 21 | 1.5% |
| 8 | 15 | 1.1% |
| 5 | 14 | 1.0% |
| 9 | 11 | 0.8% |
Math Symbol
| Value | Count | Frequency (%) |
| × | 1532 | |
| + | 2 | 0.1% |
| = | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 255077 | |
| [ | 1152 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 255076 | |
| ] | 1152 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 2158927 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19940151 | |
| Common | 3834953 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2207594 | 11.1% |
| i | 1707901 | 8.6% |
| e | 1477517 | 7.4% |
| r | 1333444 | 6.7% |
| l | 1185137 | 5.9% |
| s | 1163476 | 5.8% |
| o | 1123360 | 5.6% |
| u | 1075965 | 5.4% |
| n | 1072354 | 5.4% |
| t | 884232 | 4.4% |
| Other values (78) | 6709171 |
Common
| Value | Count | Frequency (%) |
| 2158927 | ||
| . | 1074132 | |
| ( | 255077 | 6.7% |
| ) | 255076 | 6.7% |
| & | 59726 | 1.6% |
| ' | 12741 | 0.3% |
| - | 11272 | 0.3% |
| , | 2682 | 0.1% |
| × | 1532 | < 0.1% |
| ] | 1152 | < 0.1% |
| Other values (20) | 2636 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23744990 | |
| None | 30113 | 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2207594 | 9.3% |
| 2158927 | 9.1% | |
| i | 1707901 | 7.2% |
| e | 1477517 | 6.2% |
| r | 1333444 | 5.6% |
| l | 1185137 | 5.0% |
| s | 1163476 | 4.9% |
| o | 1123360 | 4.7% |
| u | 1075965 | 4.5% |
| . | 1074132 | 4.5% |
| Other values (70) | 9237537 |
None
| Value | Count | Frequency (%) |
| ü | 13903 | |
| é | 7175 | |
| ö | 2297 | 7.6% |
| × | 1532 | 5.1% |
| ä | 1135 | 3.8% |
| ó | 807 | 2.7% |
| á | 802 | 2.7% |
| è | 641 | 2.1% |
| ø | 519 | 1.7% |
| ê | 146 | 0.5% |
| Other values (27) | 1156 | 3.8% |
Punctuation
| Value | Count | Frequency (%) |
| • | 1 |
typifiedName
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 836208 |
| Missing (%) | > 99.9% |
| Memory size | 6.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | NE |
|---|
| Value | Count | Frequency (%) |
| ne | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1 | |
| E | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| E | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1 | |
| E | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1 | |
| E | 1 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DWC_ARCHIVE |
|---|---|
| 2nd row | DWC_ARCHIVE |
| 3rd row | DWC_ARCHIVE |
| 4th row | DWC_ARCHIVE |
| 5th row | DWC_ARCHIVE |
| Value | Count | Frequency (%) |
| dwc_archive | 836207 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1672414 | |
| D | 836207 | |
| W | 836207 | |
| _ | 836207 | |
| A | 836207 | |
| R | 836207 | |
| H | 836207 | |
| I | 836207 | |
| V | 836207 | |
| E | 836207 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8362070 | |
| Connector Punctuation | 836207 | 9.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1672414 | |
| D | 836207 | |
| W | 836207 | |
| A | 836207 | |
| R | 836207 | |
| H | 836207 | |
| I | 836207 | |
| V | 836207 | |
| E | 836207 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 836207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8362070 | |
| Common | 836207 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 1672414 | |
| D | 836207 | |
| W | 836207 | |
| A | 836207 | |
| R | 836207 | |
| H | 836207 | |
| I | 836207 | |
| V | 836207 | |
| E | 836207 |
Common
| Value | Count | Frequency (%) |
| _ | 836207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9198277 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1672414 | |
| D | 836207 | |
| W | 836207 | |
| _ | 836207 | |
| A | 836207 | |
| R | 836207 | |
| H | 836207 | |
| I | 836207 | |
| V | 836207 | |
| E | 836207 |
lastParsed
Text
| Distinct | 151288 |
|---|---|
| Distinct (%) | 18.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99604404 |
| Min length | 20 |
Unique
| Unique | 19524 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | 2024-11-01T10:27:16.300Z |
|---|---|
| 2nd row | 2024-11-01T10:29:04.857Z |
| 3rd row | 2024-11-01T10:27:16.301Z |
| 4th row | 2024-11-01T10:29:41.603Z |
| 5th row | 2024-11-01T10:27:17.382Z |
| Value | Count | Frequency (%) |
| 2024-11-01t10:27:17.419z | 32 | < 0.1% |
| 2024-11-01t10:26:47.509z | 30 | < 0.1% |
| 2024-11-01t10:27:17.556z | 29 | < 0.1% |
| 2024-11-01t10:27:17.502z | 28 | < 0.1% |
| 2024-11-01t10:28:04.529z | 28 | < 0.1% |
| 2024-11-01t10:27:28.429z | 28 | < 0.1% |
| 2024-11-01t10:27:27.691z | 28 | < 0.1% |
| 2024-11-01t10:27:17.495z | 28 | < 0.1% |
| 2024-11-01t10:27:16.167z | 28 | < 0.1% |
| 2024-11-01t10:27:02.841z | 28 | < 0.1% |
| Other values (151278) | 835920 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| - | 1672414 | |
| : | 1672414 | |
| 4 | 1297614 | 6.5% |
| T | 836207 | 4.2% |
| Z | 836207 | 4.2% |
| . | 835380 | 4.2% |
| 7 | 673118 | 3.4% |
| Other values (5) | 2391619 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14213038 | |
| Other Punctuation | 2507794 | 12.5% |
| Dash Punctuation | 1672414 | 8.3% |
| Uppercase Letter | 1672414 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| 4 | 1297614 | 9.1% |
| 7 | 673118 | 4.7% |
| 8 | 599914 | 4.2% |
| 9 | 470980 | 3.3% |
| 5 | 457024 | 3.2% |
| 3 | 449982 | 3.2% |
| 6 | 413719 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1672414 | |
| . | 835380 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 836207 | |
| Z | 836207 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1672414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18393246 | |
| Latin | 1672414 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| - | 1672414 | |
| : | 1672414 | |
| 4 | 1297614 | 7.1% |
| . | 835380 | 4.5% |
| 7 | 673118 | 3.7% |
| 8 | 599914 | 3.3% |
| 9 | 470980 | 2.6% |
| Other values (3) | 1320725 | 7.2% |
Latin
| Value | Count | Frequency (%) |
| T | 836207 | |
| Z | 836207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20065660 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3864670 | |
| 0 | 3022309 | |
| 2 | 2963708 | |
| - | 1672414 | |
| : | 1672414 | |
| 4 | 1297614 | 6.5% |
| T | 836207 | 4.2% |
| Z | 836207 | 4.2% |
| . | 835380 | 4.2% |
| 7 | 673118 | 3.4% |
| Other values (5) | 2391619 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024-11-01T08:50:07.799Z |
|---|---|
| 2nd row | 2024-11-01T08:50:07.799Z |
| 3rd row | 2024-11-01T08:50:07.799Z |
| 4th row | 2024-11-01T08:50:07.799Z |
| 5th row | 2024-11-01T08:50:07.799Z |
| Value | Count | Frequency (%) |
| 2024-11-01t08:50:07.799z | 836207 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4181035 | |
| 1 | 2508621 | |
| 2 | 1672414 | 8.3% |
| - | 1672414 | 8.3% |
| : | 1672414 | 8.3% |
| 7 | 1672414 | 8.3% |
| 9 | 1672414 | 8.3% |
| 4 | 836207 | 4.2% |
| T | 836207 | 4.2% |
| 8 | 836207 | 4.2% |
| Other values (3) | 2508621 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14215519 | |
| Other Punctuation | 2508621 | 12.5% |
| Dash Punctuation | 1672414 | 8.3% |
| Uppercase Letter | 1672414 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4181035 | |
| 1 | 2508621 | |
| 2 | 1672414 | 11.8% |
| 7 | 1672414 | 11.8% |
| 9 | 1672414 | 11.8% |
| 4 | 836207 | 5.9% |
| 8 | 836207 | 5.9% |
| 5 | 836207 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1672414 | |
| . | 836207 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 836207 | |
| Z | 836207 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1672414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18396554 | |
| Latin | 1672414 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4181035 | |
| 1 | 2508621 | |
| 2 | 1672414 | 9.1% |
| - | 1672414 | 9.1% |
| : | 1672414 | 9.1% |
| 7 | 1672414 | 9.1% |
| 9 | 1672414 | 9.1% |
| 4 | 836207 | 4.5% |
| 8 | 836207 | 4.5% |
| 5 | 836207 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 836207 | |
| Z | 836207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20068968 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4181035 | |
| 1 | 2508621 | |
| 2 | 1672414 | 8.3% |
| - | 1672414 | 8.3% |
| : | 1672414 | 8.3% |
| 7 | 1672414 | 8.3% |
| 9 | 1672414 | 8.3% |
| 4 | 836207 | 4.2% |
| T | 836207 | 4.2% |
| 8 | 836207 | 4.2% |
| Other values (3) | 2508621 |
repatriated
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2188 |
| Missing (%) | 0.3% |
| Memory size | 6.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.142760194 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | true |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 714956 | |
| false | 119065 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 834021 | |
| t | 714956 | |
| r | 714956 | |
| u | 714956 | |
| f | 119065 | 3.4% |
| a | 119065 | 3.4% |
| l | 119065 | 3.4% |
| s | 119065 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3455149 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 834021 | |
| t | 714956 | |
| r | 714956 | |
| u | 714956 | |
| f | 119065 | 3.4% |
| a | 119065 | 3.4% |
| l | 119065 | 3.4% |
| s | 119065 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3455149 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 834021 | |
| t | 714956 | |
| r | 714956 | |
| u | 714956 | |
| f | 119065 | 3.4% |
| a | 119065 | 3.4% |
| l | 119065 | 3.4% |
| s | 119065 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3455149 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 834021 | |
| t | 714956 | |
| r | 714956 | |
| u | 714956 | |
| f | 119065 | 3.4% |
| a | 119065 | 3.4% |
| l | 119065 | 3.4% |
| s | 119065 | 3.4% |
isSequenced
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 836207 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 836207 | |
| a | 836207 | |
| l | 836207 | |
| s | 836207 | |
| e | 836207 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4181035 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 836207 | |
| a | 836207 | |
| l | 836207 | |
| s | 836207 | |
| e | 836207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4181035 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 836207 | |
| a | 836207 | |
| l | 836207 | |
| s | 836207 | |
| e | 836207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4181035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 836207 | |
| a | 836207 | |
| l | 836207 | |
| s | 836207 | |
| e | 836207 |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 151640 |
| Missing (%) | 18.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 6.565221329 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | ASIA |
| 3rd row | EUROPE |
| 4th row | ASIA |
| 5th row | EUROPE |
| Value | Count | Frequency (%) |
| asia | 207299 | |
| europe | 201481 | |
| africa | 110915 | |
| latin_america | 84804 | |
| oceania | 58632 | 8.6% |
| north_america | 21173 | 3.1% |
| antarctica | 265 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1051245 | |
| I | 567892 | |
| E | 567571 | |
| R | 439811 | |
| O | 281286 | 6.3% |
| C | 276054 | 6.1% |
| S | 207299 | 4.6% |
| U | 201481 | 4.5% |
| P | 201481 | 4.5% |
| N | 164874 | 3.7% |
| Other values (6) | 535353 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4388370 | |
| Connector Punctuation | 105977 | 2.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1051245 | |
| I | 567892 | |
| E | 567571 | |
| R | 439811 | |
| O | 281286 | 6.4% |
| C | 276054 | 6.3% |
| S | 207299 | 4.7% |
| U | 201481 | 4.6% |
| P | 201481 | 4.6% |
| N | 164874 | 3.8% |
| Other values (5) | 429376 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 105977 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4388370 | |
| Common | 105977 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1051245 | |
| I | 567892 | |
| E | 567571 | |
| R | 439811 | |
| O | 281286 | 6.4% |
| C | 276054 | 6.3% |
| S | 207299 | 4.7% |
| U | 201481 | 4.6% |
| P | 201481 | 4.6% |
| N | 164874 | 3.8% |
| Other values (5) | 429376 |
Common
| Value | Count | Frequency (%) |
| _ | 105977 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4494347 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1051245 | |
| I | 567892 | |
| E | 567571 | |
| R | 439811 | |
| O | 281286 | 6.3% |
| C | 276054 | 6.1% |
| S | 207299 | 4.6% |
| U | 201481 | 4.5% |
| P | 201481 | 4.5% |
| N | 164874 | 3.7% |
| Other values (6) | 535353 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 6.4 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | EUROPE |
| 3rd row | EUROPE |
| 4th row | EUROPE |
| 5th row | EUROPE |
| Value | Count | Frequency (%) |
| europe | 836207 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1672414 | |
| U | 836207 | |
| R | 836207 | |
| O | 836207 | |
| P | 836207 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5017242 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1672414 | |
| U | 836207 | |
| R | 836207 | |
| O | 836207 | |
| P | 836207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5017242 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1672414 | |
| U | 836207 | |
| R | 836207 | |
| O | 836207 | |
| P | 836207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5017242 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1672414 | |
| U | 836207 | |
| R | 836207 | |
| O | 836207 | |
| P | 836207 |
level0Gid
Text
Missing 
| Distinct | 213 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 497950 |
| Missing (%) | 59.5% |
| Memory size | 6.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | IDN |
|---|---|
| 2nd row | IDN |
| 3rd row | IDN |
| 4th row | IDN |
| 5th row | IDN |
| Value | Count | Frequency (%) |
| nld | 95646 | |
| idn | 47409 | |
| mys | 24942 | 7.4% |
| tha | 16522 | 4.9% |
| png | 16302 | 4.8% |
| cmr | 12997 | 3.8% |
| gab | 12542 | 3.7% |
| phl | 10183 | 3.0% |
| civ | 8171 | 2.4% |
| aus | 6755 | 2.0% |
| Other values (203) | 86790 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 181334 | |
| D | 154536 | |
| L | 117053 | |
| I | 61415 | 6.1% |
| A | 55005 | 5.4% |
| M | 49631 | 4.9% |
| G | 46506 | 4.6% |
| S | 41947 | 4.1% |
| H | 35579 | 3.5% |
| C | 33733 | 3.3% |
| Other values (20) | 238038 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1014671 | |
| Decimal Number | 106 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 181334 | |
| D | 154536 | |
| L | 117053 | |
| I | 61415 | 6.1% |
| A | 55005 | 5.4% |
| M | 49631 | 4.9% |
| G | 46506 | 4.6% |
| S | 41947 | 4.1% |
| H | 35579 | 3.5% |
| C | 33733 | 3.3% |
| Other values (16) | 237932 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 53 | |
| 6 | 48 | |
| 7 | 3 | 2.8% |
| 1 | 2 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1014671 | |
| Common | 106 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 181334 | |
| D | 154536 | |
| L | 117053 | |
| I | 61415 | 6.1% |
| A | 55005 | 5.4% |
| M | 49631 | 4.9% |
| G | 46506 | 4.6% |
| S | 41947 | 4.1% |
| H | 35579 | 3.5% |
| C | 33733 | 3.3% |
| Other values (16) | 237932 |
Common
| Value | Count | Frequency (%) |
| 0 | 53 | |
| 6 | 48 | |
| 7 | 3 | 2.8% |
| 1 | 2 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1014777 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 181334 | |
| D | 154536 | |
| L | 117053 | |
| I | 61415 | 6.1% |
| A | 55005 | 5.4% |
| M | 49631 | 4.9% |
| G | 46506 | 4.6% |
| S | 41947 | 4.1% |
| H | 35579 | 3.5% |
| C | 33733 | 3.3% |
| Other values (20) | 238038 |
level0Name
Text
Missing 
| Distinct | 213 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 497950 |
| Missing (%) | 59.5% |
| Memory size | 6.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 9.879816945 |
| Min length | 4 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Indonesia |
|---|---|
| 2nd row | Indonesia |
| 3rd row | Indonesia |
| 4th row | Indonesia |
| 5th row | Indonesia |
| Value | Count | Frequency (%) |
| netherlands | 95646 | |
| indonesia | 47409 | 11.4% |
| malaysia | 24942 | 6.0% |
| guinea | 18245 | 4.4% |
| new | 17522 | 4.2% |
| thailand | 16522 | 4.0% |
| papua | 16302 | 3.9% |
| cameroon | 12997 | 3.1% |
| gabon | 12542 | 3.0% |
| philippines | 10183 | 2.5% |
| Other values (247) | 143215 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 454648 | |
| e | 373810 | 11.2% |
| n | 321392 | 9.6% |
| i | 234110 | 7.0% |
| s | 196756 | 5.9% |
| d | 182043 | 5.4% |
| l | 176460 | 5.3% |
| r | 166938 | 5.0% |
| o | 140867 | 4.2% |
| h | 140773 | 4.2% |
| Other values (53) | 954140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2849741 | |
| Uppercase Letter | 404863 | 12.1% |
| Space Separator | 77266 | 2.3% |
| Other Punctuation | 9293 | 0.3% |
| Dash Punctuation | 762 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Close Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 454648 | |
| e | 373810 | |
| n | 321392 | |
| i | 234110 | |
| s | 196756 | |
| d | 182043 | 6.4% |
| l | 176460 | 6.2% |
| r | 166938 | 5.9% |
| o | 140867 | 4.9% |
| h | 140773 | 4.9% |
| Other values (21) | 461944 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 116417 | |
| I | 57503 | |
| G | 38926 | 9.6% |
| M | 32252 | 8.0% |
| C | 31850 | 7.9% |
| P | 28831 | 7.1% |
| T | 21089 | 5.2% |
| B | 14678 | 3.6% |
| S | 12858 | 3.2% |
| E | 10142 | 2.5% |
| Other values (15) | 40317 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 8171 | |
| , | 1110 | 11.9% |
| . | 12 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 77266 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 762 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3254604 | |
| Common | 87333 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 454648 | |
| e | 373810 | |
| n | 321392 | 9.9% |
| i | 234110 | 7.2% |
| s | 196756 | 6.0% |
| d | 182043 | 5.6% |
| l | 176460 | 5.4% |
| r | 166938 | 5.1% |
| o | 140867 | 4.3% |
| h | 140773 | 4.3% |
| Other values (46) | 866807 |
Common
| Value | Count | Frequency (%) |
| 77266 | ||
| ' | 8171 | 9.4% |
| , | 1110 | 1.3% |
| - | 762 | 0.9% |
| . | 12 | < 0.1% |
| ( | 6 | < 0.1% |
| ) | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3332320 | |
| None | 9617 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 454648 | |
| e | 373810 | 11.2% |
| n | 321392 | 9.6% |
| i | 234110 | 7.0% |
| s | 196756 | 5.9% |
| d | 182043 | 5.5% |
| l | 176460 | 5.3% |
| r | 166938 | 5.0% |
| o | 140867 | 4.2% |
| h | 140773 | 4.2% |
| Other values (47) | 944523 |
None
| Value | Count | Frequency (%) |
| ô | 8171 | |
| é | 615 | 6.4% |
| ç | 458 | 4.8% |
| í | 185 | 1.9% |
| ã | 185 | 1.9% |
| Å | 3 | < 0.1% |
level1Gid
Text
Missing 
| Distinct | 2131 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 499035 |
| Missing (%) | 59.7% |
| Memory size | 6.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.506898515 |
| Min length | 6 |
Unique
| Unique | 318 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | IDN.30_1 |
|---|---|
| 2nd row | IDN.12_1 |
| 3rd row | IDN.30_1 |
| 4th row | IDN.30_1 |
| 5th row | IDN.29_1 |
| Value | Count | Frequency (%) |
| nld.14_1 | 20426 | 6.1% |
| nld.4_1 | 17955 | 5.3% |
| mys.13_1 | 12106 | 3.6% |
| nld.9_1 | 10925 | 3.2% |
| mys.14_1 | 8546 | 2.5% |
| nld.7_1 | 7831 | 2.3% |
| nld.10_1 | 7165 | 2.1% |
| nld.11_1 | 7053 | 2.1% |
| nld.3_1 | 6890 | 2.0% |
| idn.34_1 | 5494 | 1.6% |
| Other values (2121) | 232783 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 498400 | |
| _ | 337128 | |
| . | 335630 | |
| N | 181311 | 7.2% |
| D | 154522 | 6.1% |
| L | 117047 | 4.6% |
| 4 | 76894 | 3.0% |
| I | 61413 | 2.4% |
| 2 | 59174 | 2.3% |
| 3 | 56308 | 2.2% |
| Other values (28) | 653304 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1011554 | |
| Decimal Number | 846819 | |
| Connector Punctuation | 337128 | 13.3% |
| Other Punctuation | 335630 | 13.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 181311 | |
| D | 154522 | |
| L | 117047 | |
| I | 61413 | 6.1% |
| A | 54659 | 5.4% |
| M | 49326 | 4.9% |
| G | 46552 | 4.6% |
| S | 41703 | 4.1% |
| H | 35625 | 3.5% |
| C | 33252 | 3.3% |
| Other values (16) | 236144 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 498400 | |
| 4 | 76894 | 9.1% |
| 2 | 59174 | 7.0% |
| 3 | 56308 | 6.6% |
| 9 | 36734 | 4.3% |
| 0 | 28618 | 3.4% |
| 7 | 25205 | 3.0% |
| 8 | 23369 | 2.8% |
| 6 | 21357 | 2.5% |
| 5 | 20760 | 2.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 337128 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 335630 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1519577 | |
| Latin | 1011554 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 181311 | |
| D | 154522 | |
| L | 117047 | |
| I | 61413 | 6.1% |
| A | 54659 | 5.4% |
| M | 49326 | 4.9% |
| G | 46552 | 4.6% |
| S | 41703 | 4.1% |
| H | 35625 | 3.5% |
| C | 33252 | 3.3% |
| Other values (16) | 236144 |
Common
| Value | Count | Frequency (%) |
| 1 | 498400 | |
| _ | 337128 | |
| . | 335630 | |
| 4 | 76894 | 5.1% |
| 2 | 59174 | 3.9% |
| 3 | 56308 | 3.7% |
| 9 | 36734 | 2.4% |
| 0 | 28618 | 1.9% |
| 7 | 25205 | 1.7% |
| 8 | 23369 | 1.5% |
| Other values (2) | 42117 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2531131 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 498400 | |
| _ | 337128 | |
| . | 335630 | |
| N | 181311 | 7.2% |
| D | 154522 | 6.1% |
| L | 117047 | 4.6% |
| 4 | 76894 | 3.0% |
| I | 61413 | 2.4% |
| 2 | 59174 | 2.3% |
| 3 | 56308 | 2.2% |
| Other values (28) | 653304 |
level1Name
Text
Missing 
| Distinct | 2070 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 499035 |
| Missing (%) | 59.7% |
| Memory size | 6.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 9.465842562 |
| Min length | 3 |
Unique
| Unique | 312 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sumatera Barat |
|---|---|
| 2nd row | Kalimantan Barat |
| 3rd row | Sumatera Barat |
| 4th row | Sumatera Barat |
| 5th row | Sulawesi Utara |
| Value | Count | Frequency (%) |
| zuid-holland | 19921 | 4.7% |
| gelderland | 17955 | 4.3% |
| kalimantan | 12145 | 2.9% |
| sabah | 12106 | 2.9% |
| barat | 11903 | 2.8% |
| noord-holland | 10925 | 2.6% |
| jawa | 9106 | 2.2% |
| sarawak | 8546 | 2.0% |
| timur | 8363 | 2.0% |
| limburg | 7831 | 1.9% |
| Other values (2239) | 302728 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 461320 | |
| n | 235409 | 7.4% |
| e | 207213 | 6.5% |
| r | 207212 | 6.5% |
| l | 194779 | 6.1% |
| o | 174582 | 5.5% |
| i | 162392 | 5.1% |
| d | 146654 | 4.6% |
| u | 138738 | 4.3% |
| t | 129029 | 4.0% |
| Other values (120) | 1134308 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2571860 | |
| Uppercase Letter | 475768 | 14.9% |
| Space Separator | 84355 | 2.6% |
| Dash Punctuation | 58213 | 1.8% |
| Other Punctuation | 1317 | < 0.1% |
| Open Punctuation | 59 | < 0.1% |
| Close Punctuation | 59 | < 0.1% |
| Modifier Symbol | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 461320 | |
| n | 235409 | |
| e | 207213 | |
| r | 207212 | |
| l | 194779 | 7.6% |
| o | 174582 | 6.8% |
| i | 162392 | 6.3% |
| d | 146654 | 5.7% |
| u | 138738 | 5.4% |
| t | 129029 | 5.0% |
| Other values (74) | 514532 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 60946 | |
| N | 39234 | 8.2% |
| H | 37269 | 7.8% |
| B | 34551 | 7.3% |
| M | 30561 | 6.4% |
| Z | 28371 | 6.0% |
| T | 26791 | 5.6% |
| G | 24375 | 5.1% |
| O | 23090 | 4.9% |
| K | 21541 | 4.5% |
| Other values (24) | 149039 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 923 | |
| ' | 325 | 24.7% |
| ! | 56 | 4.3% |
| . | 7 | 0.5% |
| / | 6 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 58 | |
| ( | 1 | 1.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 58 | |
| ) | 1 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 84355 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 58213 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3047628 | |
| Common | 144008 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 461320 | |
| n | 235409 | 7.7% |
| e | 207213 | 6.8% |
| r | 207212 | 6.8% |
| l | 194779 | 6.4% |
| o | 174582 | 5.7% |
| i | 162392 | 5.3% |
| d | 146654 | 4.8% |
| u | 138738 | 4.6% |
| t | 129029 | 4.2% |
| Other values (108) | 990300 |
Common
| Value | Count | Frequency (%) |
| 84355 | ||
| - | 58213 | |
| , | 923 | 0.6% |
| ' | 325 | 0.2% |
| [ | 58 | < 0.1% |
| ] | 58 | < 0.1% |
| ! | 56 | < 0.1% |
| . | 7 | < 0.1% |
| / | 6 | < 0.1% |
| ` | 5 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3162469 | |
| None | 28570 | 0.9% |
| Latin Ext Additional | 597 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 461320 | |
| n | 235409 | 7.4% |
| e | 207213 | 6.6% |
| r | 207212 | 6.6% |
| l | 194779 | 6.2% |
| o | 174582 | 5.5% |
| i | 162392 | 5.1% |
| d | 146654 | 4.6% |
| u | 138738 | 4.4% |
| t | 129029 | 4.1% |
| Other values (54) | 1105141 |
None
| Value | Count | Frequency (%) |
| é | 14382 | |
| â | 7294 | |
| í | 1163 | 4.1% |
| á | 1121 | 3.9% |
| ê | 657 | 2.3% |
| ó | 577 | 2.0% |
| ô | 433 | 1.5% |
| ì | 419 | 1.5% |
| ã | 329 | 1.2% |
| É | 283 | 1.0% |
| Other values (42) | 1912 | 6.7% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ồ | 185 | |
| ả | 72 | 12.1% |
| ẵ | 54 | 9.0% |
| ắ | 52 | 8.7% |
| ệ | 41 | 6.9% |
| ộ | 38 | 6.4% |
| ế | 37 | 6.2% |
| ừ | 37 | 6.2% |
| ậ | 30 | 5.0% |
| ạ | 18 | 3.0% |
| Other values (4) | 33 | 5.5% |
level2Gid
Text
Missing 
| Distinct | 8865 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 501994 |
| Missing (%) | 60.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 9.939694508 |
| Min length | 7 |
Unique
| Unique | 2458 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | IDN.30.15_1 |
|---|---|
| 2nd row | IDN.12.14_1 |
| 3rd row | IDN.30.5_1 |
| 4th row | IDN.30.5_1 |
| 5th row | IDN.29.10_1 |
| Value | Count | Frequency (%) |
| nld.14.38_1 | 4144 | 1.2% |
| cmr.10.3_1 | 3732 | 1.1% |
| nld.14.67_2 | 2667 | 0.8% |
| civ.1.1_1 | 2431 | 0.7% |
| nld.4.44_1 | 1997 | 0.6% |
| idn.34.6_1 | 1970 | 0.6% |
| nld.14.84_1 | 1952 | 0.6% |
| nld.14.2_1 | 1880 | 0.6% |
| nld.9.4_1 | 1814 | 0.5% |
| png.14.1_1 | 1673 | 0.5% |
| Other values (8855) | 309955 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 666840 | |
| 1 | 590323 | |
| _ | 334215 | |
| N | 181156 | 5.5% |
| D | 154398 | 4.6% |
| 2 | 150376 | 4.5% |
| 4 | 127623 | 3.8% |
| 3 | 119036 | 3.6% |
| L | 116827 | 3.5% |
| I | 61334 | 1.8% |
| Other values (28) | 819867 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1318401 | |
| Uppercase Letter | 1002539 | |
| Other Punctuation | 666840 | |
| Connector Punctuation | 334215 | 10.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 181156 | |
| D | 154398 | |
| L | 116827 | |
| I | 61334 | 6.1% |
| A | 54491 | 5.4% |
| M | 48964 | 4.9% |
| G | 45690 | 4.6% |
| S | 39463 | 3.9% |
| H | 35575 | 3.5% |
| C | 32901 | 3.3% |
| Other values (16) | 231740 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 590323 | |
| 2 | 150376 | 11.4% |
| 4 | 127623 | 9.7% |
| 3 | 119036 | 9.0% |
| 6 | 60673 | 4.6% |
| 7 | 57260 | 4.3% |
| 9 | 57181 | 4.3% |
| 5 | 54603 | 4.1% |
| 8 | 52509 | 4.0% |
| 0 | 48817 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 666840 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 334215 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2319456 | |
| Latin | 1002539 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 181156 | |
| D | 154398 | |
| L | 116827 | |
| I | 61334 | 6.1% |
| A | 54491 | 5.4% |
| M | 48964 | 4.9% |
| G | 45690 | 4.6% |
| S | 39463 | 3.9% |
| H | 35575 | 3.5% |
| C | 32901 | 3.3% |
| Other values (16) | 231740 |
Common
| Value | Count | Frequency (%) |
| . | 666840 | |
| 1 | 590323 | |
| _ | 334215 | |
| 2 | 150376 | 6.5% |
| 4 | 127623 | 5.5% |
| 3 | 119036 | 5.1% |
| 6 | 60673 | 2.6% |
| 7 | 57260 | 2.5% |
| 9 | 57181 | 2.5% |
| 5 | 54603 | 2.4% |
| Other values (2) | 101326 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3321995 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 666840 | |
| 1 | 590323 | |
| _ | 334215 | |
| N | 181156 | 5.5% |
| D | 154398 | 4.6% |
| 2 | 150376 | 4.5% |
| 4 | 127623 | 3.8% |
| 3 | 119036 | 3.6% |
| L | 116827 | 3.5% |
| I | 61334 | 1.8% |
| Other values (28) | 819867 |
level2Name
Text
Missing 
| Distinct | 8635 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 501999 |
| Missing (%) | 60.0% |
| Memory size | 6.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 9.193734478 |
| Min length | 1 |
Unique
| Unique | 2322 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Pesisir Selatan |
|---|---|
| 2nd row | Sintang |
| 3rd row | Kepulauan Mentawai |
| 4th row | Kepulauan Mentawai |
| 5th row | Minahasa Selatan |
| Value | Count | Frequency (%) |
| leiden | 4144 | 0.9% |
| kota | 4103 | 0.9% |
| de | 3968 | 0.9% |
| océan | 3732 | 0.8% |
| kutai | 3500 | 0.8% |
| timur | 3319 | 0.7% |
| city | 2951 | 0.7% |
| rotterdam | 2667 | 0.6% |
| et | 2526 | 0.6% |
| abidjan | 2431 | 0.5% |
| Other values (8887) | 415746 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 382262 | 12.4% |
| e | 266254 | 8.7% |
| n | 240185 | 7.8% |
| o | 187854 | 6.1% |
| r | 167643 | 5.5% |
| i | 166065 | 5.4% |
| u | 144544 | 4.7% |
| t | 118217 | 3.8% |
| 114877 | 3.7% | |
| l | 107136 | 3.5% |
| Other values (157) | 1177601 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2471835 | |
| Uppercase Letter | 453959 | 14.8% |
| Space Separator | 114877 | 3.7% |
| Dash Punctuation | 23094 | 0.8% |
| Other Punctuation | 4972 | 0.2% |
| Decimal Number | 2213 | 0.1% |
| Open Punctuation | 856 | < 0.1% |
| Close Punctuation | 675 | < 0.1% |
| Math Symbol | 88 | < 0.1% |
| Modifier Symbol | 69 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 382262 | |
| e | 266254 | |
| n | 240185 | |
| o | 187854 | 7.6% |
| r | 167643 | 6.8% |
| i | 166065 | 6.7% |
| u | 144544 | 5.8% |
| t | 118217 | 4.8% |
| l | 107136 | 4.3% |
| g | 97879 | 4.0% |
| Other values (92) | 593796 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 50216 | 11.1% |
| B | 41287 | 9.1% |
| K | 40415 | 8.9% |
| S | 35291 | 7.8% |
| T | 31125 | 6.9% |
| L | 24861 | 5.5% |
| A | 23385 | 5.2% |
| C | 19951 | 4.4% |
| N | 19626 | 4.3% |
| R | 19505 | 4.3% |
| Other values (32) | 148297 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 798 | |
| 1 | 487 | |
| 8 | 328 | |
| 7 | 209 | 9.4% |
| 0 | 187 | 8.5% |
| 3 | 124 | 5.6% |
| 6 | 30 | 1.4% |
| 2 | 21 | 0.9% |
| 5 | 17 | 0.8% |
| 4 | 12 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 3095 | |
| . | 1138 | 22.9% |
| / | 374 | 7.5% |
| , | 254 | 5.1% |
| & | 72 | 1.4% |
| # | 33 | 0.7% |
| ? | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 114877 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23094 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 856 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 675 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 88 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 69 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2925794 | |
| Common | 146844 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 382262 | 13.1% |
| e | 266254 | 9.1% |
| n | 240185 | 8.2% |
| o | 187854 | 6.4% |
| r | 167643 | 5.7% |
| i | 166065 | 5.7% |
| u | 144544 | 4.9% |
| t | 118217 | 4.0% |
| l | 107136 | 3.7% |
| g | 97879 | 3.3% |
| Other values (134) | 1047755 |
Common
| Value | Count | Frequency (%) |
| 114877 | ||
| - | 23094 | 15.7% |
| ' | 3095 | 2.1% |
| . | 1138 | 0.8% |
| ( | 856 | 0.6% |
| 9 | 798 | 0.5% |
| ) | 675 | 0.5% |
| 1 | 487 | 0.3% |
| / | 374 | 0.3% |
| 8 | 328 | 0.2% |
| Other values (13) | 1122 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3044873 | |
| None | 27150 | 0.9% |
| Latin Ext Additional | 606 | < 0.1% |
| IPA Ext | 9 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 382262 | 12.6% |
| e | 266254 | 8.7% |
| n | 240185 | 7.9% |
| o | 187854 | 6.2% |
| r | 167643 | 5.5% |
| i | 166065 | 5.5% |
| u | 144544 | 4.7% |
| t | 118217 | 3.9% |
| 114877 | 3.8% | |
| l | 107136 | 3.5% |
| Other values (65) | 1149836 |
None
| Value | Count | Frequency (%) |
| é | 15359 | |
| è | 1744 | 6.4% |
| â | 1202 | 4.4% |
| É | 1058 | 3.9% |
| ô | 1018 | 3.7% |
| í | 1011 | 3.7% |
| ú | 913 | 3.4% |
| á | 886 | 3.3% |
| ñ | 666 | 2.5% |
| ó | 631 | 2.3% |
| Other values (56) | 2662 | 9.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 115 | |
| ả | 93 | |
| ủ | 74 | |
| ế | 70 | |
| ộ | 53 | |
| ậ | 47 | |
| ầ | 28 | 4.6% |
| ồ | 25 | 4.1% |
| ắ | 21 | 3.5% |
| ợ | 17 | 2.8% |
| Other values (15) | 63 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 9 |
level3Gid
Text
Missing 
| Distinct | 10466 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 695853 |
| Missing (%) | 83.2% |
| Memory size | 6.4 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 12.10025222 |
| Min length | 11 |
Unique
| Unique | 3314 ? |
|---|---|
| Unique (%) | 2.4% |
Sample
| 1st row | IDN.30.15.10_1 |
|---|---|
| 2nd row | IDN.12.14.4_1 |
| 3rd row | IDN.30.5.9_1 |
| 4th row | IDN.30.5.9_1 |
| 5th row | IDN.29.10.3_1 |
| Value | Count | Frequency (%) |
| civ.1.1.1_1 | 2431 | 1.7% |
| civ.14.2.2_1 | 1100 | 0.8% |
| cmr.10.3.2_1 | 1089 | 0.8% |
| idn.9.16.3_1 | 919 | 0.7% |
| idn.34.6.15_1 | 770 | 0.5% |
| civ.2.1.2_1 | 762 | 0.5% |
| idn.22.5.10_1 | 758 | 0.5% |
| tza.13.10.27_1 | 647 | 0.5% |
| cmr.10.3.4_1 | 621 | 0.4% |
| cmr.10.3.6_1 | 617 | 0.4% |
| Other values (10456) | 130642 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 421068 | |
| 1 | 301027 | |
| _ | 140356 | 8.3% |
| 2 | 95511 | 5.6% |
| 3 | 71465 | 4.2% |
| I | 60394 | 3.6% |
| N | 58352 | 3.4% |
| D | 53842 | 3.2% |
| 4 | 52782 | 3.1% |
| 5 | 38084 | 2.2% |
| Other values (26) | 405462 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 715957 | |
| Other Punctuation | 421068 | |
| Uppercase Letter | 420962 | |
| Connector Punctuation | 140356 | 8.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 60394 | |
| N | 58352 | |
| D | 53842 | |
| H | 33851 | |
| T | 27392 | 6.5% |
| C | 25011 | 5.9% |
| A | 24690 | 5.9% |
| M | 21474 | 5.1% |
| R | 20395 | 4.8% |
| E | 18212 | 4.3% |
| Other values (14) | 77349 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 301027 | |
| 2 | 95511 | 13.3% |
| 3 | 71465 | 10.0% |
| 4 | 52782 | 7.4% |
| 5 | 38084 | 5.3% |
| 6 | 36571 | 5.1% |
| 9 | 33035 | 4.6% |
| 0 | 32171 | 4.5% |
| 7 | 28483 | 4.0% |
| 8 | 26828 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 421068 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 140356 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1277381 | |
| Latin | 420962 | 24.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 60394 | |
| N | 58352 | |
| D | 53842 | |
| H | 33851 | |
| T | 27392 | 6.5% |
| C | 25011 | 5.9% |
| A | 24690 | 5.9% |
| M | 21474 | 5.1% |
| R | 20395 | 4.8% |
| E | 18212 | 4.3% |
| Other values (14) | 77349 |
Common
| Value | Count | Frequency (%) |
| . | 421068 | |
| 1 | 301027 | |
| _ | 140356 | 11.0% |
| 2 | 95511 | 7.5% |
| 3 | 71465 | 5.6% |
| 4 | 52782 | 4.1% |
| 5 | 38084 | 3.0% |
| 6 | 36571 | 2.9% |
| 9 | 33035 | 2.6% |
| 0 | 32171 | 2.5% |
| Other values (2) | 55311 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1698343 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 421068 | |
| 1 | 301027 | |
| _ | 140356 | 8.3% |
| 2 | 95511 | 5.6% |
| 3 | 71465 | 4.2% |
| I | 60394 | 3.6% |
| N | 58352 | 3.4% |
| D | 53842 | 3.2% |
| 4 | 52782 | 3.1% |
| 5 | 38084 | 2.2% |
| Other values (26) | 405462 |
level3Name
Text
Missing 
| Distinct | 9920 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 697659 |
| Missing (%) | 83.4% |
| Memory size | 6.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 8.747542403 |
| Min length | 2 |
Unique
| Unique | 3029 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | Pancung Soal |
|---|---|
| 2nd row | Kayan Hilir |
| 3rd row | Sipora Selatan |
| 4th row | Sipora Selatan |
| 5th row | Amurang |
| Value | Count | Frequency (%) |
| selatan | 2482 | 1.3% |
| abidjan | 2431 | 1.2% |
| tengah | 2360 | 1.2% |
| utara | 1834 | 0.9% |
| ban | 1662 | 0.8% |
| n.a | 1613 | 0.8% |
| barat | 1608 | 0.8% |
| mae | 1569 | 0.8% |
| 1 | 1529 | 0.8% |
| na | 1448 | 0.7% |
| Other values (9982) | 177214 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 199433 | |
| n | 96033 | 7.9% |
| o | 74905 | 6.2% |
| i | 66728 | 5.5% |
| e | 61766 | 5.1% |
| u | 60137 | 5.0% |
| 57200 | 4.7% | |
| r | 54108 | 4.5% |
| g | 41770 | 3.4% |
| l | 37532 | 3.1% |
| Other values (132) | 462360 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 938705 | |
| Uppercase Letter | 192847 | 15.9% |
| Space Separator | 57200 | 4.7% |
| Decimal Number | 9632 | 0.8% |
| Other Punctuation | 5353 | 0.4% |
| Dash Punctuation | 3729 | 0.3% |
| Open Punctuation | 2281 | 0.2% |
| Close Punctuation | 2225 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 199433 | |
| n | 96033 | |
| o | 74905 | 8.0% |
| i | 66728 | 7.1% |
| e | 61766 | 6.6% |
| u | 60137 | 6.4% |
| r | 54108 | 5.8% |
| g | 41770 | 4.4% |
| l | 37532 | 4.0% |
| t | 32903 | 3.5% |
| Other values (74) | 213390 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 23353 | |
| B | 20810 | |
| T | 18280 | |
| M | 17566 | |
| K | 15980 | 8.3% |
| P | 12777 | 6.6% |
| A | 12574 | 6.5% |
| N | 9940 | 5.2% |
| C | 9519 | 4.9% |
| L | 9004 | 4.7% |
| Other values (25) | 43044 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3088 | |
| 2 | 2093 | |
| 4 | 1397 | |
| 6 | 789 | 8.2% |
| 3 | 732 | 7.6% |
| 0 | 397 | 4.1% |
| 5 | 358 | 3.7% |
| 8 | 292 | 3.0% |
| 9 | 288 | 3.0% |
| 7 | 198 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3464 | |
| ' | 1197 | 22.4% |
| / | 563 | 10.5% |
| , | 106 | 2.0% |
| ! | 14 | 0.3% |
| \ | 4 | 0.1% |
| : | 3 | 0.1% |
| * | 1 | < 0.1% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 57200 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3729 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2281 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2225 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1131552 | |
| Common | 80420 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 199433 | |
| n | 96033 | 8.5% |
| o | 74905 | 6.6% |
| i | 66728 | 5.9% |
| e | 61766 | 5.5% |
| u | 60137 | 5.3% |
| r | 54108 | 4.8% |
| g | 41770 | 3.7% |
| l | 37532 | 3.3% |
| t | 32903 | 2.9% |
| Other values (109) | 406237 |
Common
| Value | Count | Frequency (%) |
| 57200 | ||
| - | 3729 | 4.6% |
| . | 3464 | 4.3% |
| 1 | 3088 | 3.8% |
| ( | 2281 | 2.8% |
| ) | 2225 | 2.8% |
| 2 | 2093 | 2.6% |
| 4 | 1397 | 1.7% |
| ' | 1197 | 1.5% |
| 6 | 789 | 1.0% |
| Other values (13) | 2957 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1200460 | |
| None | 10685 | 0.9% |
| Latin Ext Additional | 827 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 199433 | |
| n | 96033 | 8.0% |
| o | 74905 | 6.2% |
| i | 66728 | 5.6% |
| e | 61766 | 5.1% |
| u | 60137 | 5.0% |
| 57200 | 4.8% | |
| r | 54108 | 4.5% |
| g | 41770 | 3.5% |
| l | 37532 | 3.1% |
| Other values (65) | 450848 |
None
| Value | Count | Frequency (%) |
| é | 6399 | |
| è | 1009 | 9.4% |
| ơ | 414 | 3.9% |
| ư | 410 | 3.8% |
| ú | 369 | 3.5% |
| ï | 312 | 2.9% |
| ñ | 261 | 2.4% |
| á | 198 | 1.9% |
| í | 170 | 1.6% |
| ê | 155 | 1.5% |
| Other values (34) | 988 | 9.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 143 | |
| ạ | 111 | |
| ố | 88 | |
| ộ | 77 | |
| ọ | 55 | 6.7% |
| ế | 55 | 6.7% |
| ằ | 41 | 5.0% |
| ị | 40 | 4.8% |
| ờ | 33 | 4.0% |
| ậ | 32 | 3.9% |
| Other values (13) | 152 |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73123 |
| Missing (%) | 8.7% |
| Memory size | 6.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | CR |
| 3rd row | NE |
| 4th row | EN |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 566671 | |
| lc | 172272 | 22.6% |
| vu | 7992 | 1.0% |
| nt | 6121 | 0.8% |
| en | 4663 | 0.6% |
| dd | 3658 | 0.5% |
| cr | 1620 | 0.2% |
| ew | 45 | < 0.1% |
| ex | 44 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 577455 | |
| E | 571423 | |
| C | 173892 | 11.4% |
| L | 172272 | 11.3% |
| V | 7992 | 0.5% |
| U | 7992 | 0.5% |
| D | 7316 | 0.5% |
| T | 6121 | 0.4% |
| R | 1620 | 0.1% |
| W | 45 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1526172 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 577455 | |
| E | 571423 | |
| C | 173892 | 11.4% |
| L | 172272 | 11.3% |
| V | 7992 | 0.5% |
| U | 7992 | 0.5% |
| D | 7316 | 0.5% |
| T | 6121 | 0.4% |
| R | 1620 | 0.1% |
| W | 45 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1526172 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 577455 | |
| E | 571423 | |
| C | 173892 | 11.4% |
| L | 172272 | 11.3% |
| V | 7992 | 0.5% |
| U | 7992 | 0.5% |
| D | 7316 | 0.5% |
| T | 6121 | 0.4% |
| R | 1620 | 0.1% |
| W | 45 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1526172 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 577455 | |
| E | 571423 | |
| C | 173892 | 11.4% |
| L | 172272 | 11.3% |
| V | 7992 | 0.5% |
| U | 7992 | 0.5% |
| D | 7316 | 0.5% |
| T | 6121 | 0.4% |
| R | 1620 | 0.1% |
| W | 45 | < 0.1% |